pith. sign in

arxiv: 2604.06089 · v1 · submitted 2026-04-07 · 📡 eess.SY · cs.SY· math.OC

Coalitional Zero-Sum Games for {H_(infty)} Leader-Following Consensus Control

Pith reviewed 2026-05-10 19:32 UTC · model grok-4.3

classification 📡 eess.SY cs.SYmath.OC
keywords multi-agent systemsleader-following consensusH-infinity controlzero-sum gamesdistributed controlalgebraic Riccati equationadversarial attacksdynamic average consensus
0
0 comments X

The pith

Formulating multi-agent leader-following consensus under adversarial attacks as a coalitional zero-sum game yields a distributed H∞ control law.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

This paper casts the leader-following consensus problem for multi-agent systems facing external adversarial inputs as a global coalitional min-max zero-sum game in which the agents jointly minimize a shared cost while the attacks maximize it. The optimal game policy produces an H∞ robust controller that attenuates the disturbances. To implement this without a central processor, the authors decompose the high-dimensional generalized algebraic Riccati equation into several identical lower-dimensional equations and apply a dynamic average consensus algorithm to remove the remaining coupling in the control law. A sympathetic reader would care because the method lets a network of vehicles or robots maintain formation or agreement even when some members are under attack, using only local computations.

Core claim

The paper claims that the robust leader-following control problem under adversarial attacks can be solved by formulating it as a coalitional zero-sum differential game whose solution is an H∞ controller, and that the associated high-dimensional GARE can be decomposed into uniform lower-dimensional GAREs whose solutions are combined via a dynamic average consensus algorithm to obtain a fully distributed control law.

What carries the argument

The coalitional min-max zero-sum game whose value function satisfies the high-dimensional GARE; this GARE is decomposed into lower-dimensional copies whose coupling is resolved by dynamic average consensus.

If this is right

  • The game-derived policy guarantees H∞ performance against disturbances modeled as attacks.
  • Each agent computes its control input using only local state information after the decomposition and consensus steps.
  • The method applies directly to formation control of multi-vehicle systems whose dynamics have been feedback-linearized.
  • Distributed implementation removes the need for a central node to solve or store the full high-dimensional Riccati equation.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • If the exact decomposition holds, the same game-theoretic reduction could be applied to other network disturbance-rejection tasks that currently require centralized Riccati solutions.
  • The dynamic average consensus step may produce transient mismatch during finite-time convergence, so performance bounds would need explicit finite-time analysis.
  • The linear-system assumption leaves open whether the decomposition and consensus technique extend to nonlinear agent dynamics without losing the H∞ guarantee.

Load-bearing premise

The high-dimensional GARE can be split into multiple uniform lower-dimensional GAREs while exactly preserving the optimality and robustness properties of the original centralized solution.

What would settle it

A side-by-side simulation on the same multi-vehicle formation where the distributed control inputs and closed-loop trajectories from the decomposed GAREs plus consensus differ measurably from those produced by solving the full centralized GARE.

Figures

Figures reproduced from arXiv: 2604.06089 by Dingguo Liang, Yuezu Lv, Yunxiao Ren, Zhisheng Duan.

Figure 1
Figure 1. Figure 1: Conceptual diagram of the global coalitional min-max zero-sum game framework for multi-agent systems, illustrating the structural game relationship [PITH_FULL_IMAGE:figures/full_fig_p004_1.png] view at source ↗
Figure 3
Figure 3. Figure 3: Schematic of the ith vehicle. of vehicle i. Similar to [29], the dynamics of the ith vehicle can be reformulated as follows by feedback linearization at a fixed reference point qi = [qx,i, qy,i] T off the center of the vehicle. d dt  qi q˙i  =  02 I2 02 02   qi q˙i  +  02 I2  ui +  02 I2  wi , (58) where ui = [ux,i uy,i] T is the control input, wi = [wx,i wy,i] T is the energy-bounded attack inpu… view at source ↗
Figure 4
Figure 4. Figure 4: The convergence of Algorithm 1. -20 -15 -10 -5 0 5 -5 0 5 10 vehicle1 vehicle2 vehicle3 vehicle4 vehicle5 vehicle6 vehicle7 vehicle8 [PITH_FULL_IMAGE:figures/full_fig_p009_4.png] view at source ↗
Figure 5
Figure 5. Figure 5: The trajectories of 8 vehicles. Fig.4 shows the consensys error of Algorithm 1, the result demonstrates that Algorithm can converge very fast meet the decoupling requirements. Fig.5 illustrates the trajectories of 8 vehicles, it can be obatained that the 8 vehicles can effectively form a formatiion and transform the formation by reset their relative positions with the leader. The convergence result of the … view at source ↗
Figure 8
Figure 8. Figure 8: The worst-cast energy-bounded attack inputs of 8 vehicles. [PITH_FULL_IMAGE:figures/full_fig_p009_8.png] view at source ↗
read the original abstract

This paper investigates the leader-following consensus problem for a class of multi-agent systems subject to adversarial attack-like external inputs. To address this, we formulate the robust leader-following control problem as a global coalitional min-max zero-sum game using differential game theory. Specifically, the agents' control inputs form a coalition to minimize a global cost function, while the attacks form an opposing coalition to maximize it. Notably, when these external adversarial attacks manifest as disturbances, the designed game-theoretic control policy systematically yields a robust $H_\infty$ control law. Addressing this problem inherently requires solving a high-dimensional generalized algebraic Riccati equation (GARE), which poses significant challenges for distributed computation and controller implementation. To overcome these challenges, we propose a two-fold approach. First, a decentralized computational strategy is devised to decompose the high-dimensional GARE into multiple uniform, lower-dimensional GAREs. Second, a dynamic average consensus-based decoupling algorithm is developed to resolve the inherent coupling structure of the robust control law, thereby facilitating its distributed implementation. Finally, numerical simulations on the formation control of multi-vehicle systems with feedback-linearized dynamics are conducted to validate the effectiveness of the proposed algorithms.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. The paper formulates the H∞ leader-following consensus problem for multi-agent systems under adversarial disturbances as a coalitional min-max zero-sum differential game. The resulting high-dimensional generalized algebraic Riccati equation (GARE) is decomposed into multiple uniform lower-dimensional GAREs, and a dynamic average consensus algorithm is used to decouple the control law for distributed implementation. Effectiveness is illustrated via numerical simulations on formation control of feedback-linearized multi-vehicle systems.

Significance. If the decomposition exactly preserves the saddle-point solution and the H∞ performance bound, the approach would offer a practical route to distributed robust control for MAS, mitigating the computational burden of global GAREs while retaining theoretical guarantees. The simulation results on multi-vehicle formation provide concrete evidence of applicability, but the strength hinges on rigorous verification that the distributed law matches the centralized H∞ optimum.

major comments (2)
  1. [the proposed decentralized computational strategy] The central claim that the high-dimensional GARE decomposes into uniform lower-dimensional GAREs while preserving optimality and the H∞ bound (two-fold approach) requires explicit proof. For leader-following MAS the closed-loop dynamics involve the graph Laplacian; uniformity of the decomposed Riccati solutions holds only under restrictive assumptions (identical agents, regular undirected graphs). Without a general proof that the reassembled solution satisfies the original saddle-point condition, the distributed policy may lose the guaranteed disturbance attenuation level.
  2. [the dynamic average consensus-based decoupling algorithm] The dynamic average consensus decoupling step must be shown not to perturb the control input away from the exact H∞ saddle point. Any finite-time estimation error or communication delay introduces a mismatch that can violate the min-max optimality and degrade the closed-loop H∞ norm; the manuscript should quantify the resulting performance loss or provide a stability margin.
minor comments (2)
  1. Notation for the coalitional cost function and the disturbance attenuation level γ should be introduced with explicit definitions before the GARE is stated.
  2. The simulation section would benefit from a direct comparison of the achieved H∞ norm against the centralized solution and against a non-decomposed baseline to quantify any performance gap.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive and detailed comments, which help strengthen the theoretical foundations of our work. We address each major comment point by point below and will revise the manuscript accordingly.

read point-by-point responses
  1. Referee: The central claim that the high-dimensional GARE decomposes into uniform lower-dimensional GAREs while preserving optimality and the H∞ bound (two-fold approach) requires explicit proof. For leader-following MAS the closed-loop dynamics involve the graph Laplacian; uniformity of the decomposed Riccati solutions holds only under restrictive assumptions (identical agents, regular undirected graphs). Without a general proof that the reassembled solution satisfies the original saddle-point condition, the distributed policy may lose the guaranteed disturbance attenuation level.

    Authors: We agree that an explicit proof is required to rigorously establish preservation of the saddle-point solution and the H∞ performance bound. The manuscript develops the decomposition specifically for identical agents over undirected regular graphs, where the Laplacian structure enables uniform lower-dimensional GAREs. In the revised version we will insert a new theorem together with a complete proof demonstrating that the reassembled solution satisfies the original min-max saddle-point condition under these assumptions, thereby retaining the guaranteed disturbance attenuation level. We will also clarify the assumptions and note that uniformity need not hold for non-identical agents or directed graphs. revision: yes

  2. Referee: The dynamic average consensus decoupling step must be shown not to perturb the control input away from the exact H∞ saddle point. Any finite-time estimation error or communication delay introduces a mismatch that can violate the min-max optimality and degrade the closed-loop H∞ norm; the manuscript should quantify the resulting performance loss or provide a stability margin.

    Authors: The dynamic average consensus protocol converges asymptotically to the exact average, so the distributed control law converges to the centralized H∞ saddle-point solution. To address finite-time estimation errors and bounded communication delays, the revised manuscript will include a Lyapunov-based analysis that quantifies the resulting deviation from the optimal H∞ norm and supplies an explicit stability margin together with a bound on performance degradation. revision: yes

Circularity Check

0 steps flagged

No circularity: standard differential game theory applied to H∞ formulation; decomposition and consensus steps are constructive proposals, not reductions to inputs or self-citations

full rationale

The derivation begins with a standard formulation of the leader-following consensus problem as a coalitional min-max zero-sum game drawn from differential game theory, which directly yields the H∞ control law via the global GARE as a known consequence of the LQ setup. The subsequent two-fold approach (decomposition of the GARE into uniform lower-dimensional instances and dynamic-average-consensus decoupling) consists of explicitly constructed algorithms whose correctness is asserted via derivation rather than by redefining the target quantity in terms of itself or by load-bearing self-citation. No equation is shown to equal its own fitted parameter or prior result by construction, and the paper does not invoke uniqueness theorems or ansatzes from the authors' own prior work to force the outcome. The chain therefore remains self-contained against external benchmarks of game-theoretic H∞ control.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The central claim rests on standard assumptions from differential game theory and linear-quadratic H∞ control; no explicit free parameters, new entities, or ad-hoc axioms are introduced in the abstract.

axioms (1)
  • domain assumption The multi-agent system dynamics are linear and the performance index is quadratic.
    Required for the GARE to arise from the min-max game.

pith-pipeline@v0.9.0 · 5522 in / 1247 out tokens · 62190 ms · 2026-05-10T19:32:09.029892+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

What do these tags mean?
matches
The paper's claim is directly supported by a theorem in the formal canon.
supports
The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends
The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses
The paper appears to rely on the theorem as machinery.
contradicts
The paper's claim conflicts with a theorem or certificate in the canon.
unclear
Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

29 extracted references · 29 canonical work pages

  1. [1]

    Consensus building in multi-vehicle systems with information feedback,

    W. Ren, “Consensus building in multi-vehicle systems with information feedback,” in2006 International Conference on Mechatronics and Automation, pp. 37–42, 2006

  2. [2]

    Finite-frequencyH −/H∞ unknown input observer-based distributed fault detection for multi-agent systems,

    D. Liang, Y . Yang, R. Li, and R. Liu, “Finite-frequencyH −/H∞ unknown input observer-based distributed fault detection for multi-agent systems,”Journal of the Franklin Institute, vol. 358, no. 6, pp. 3258– 3275, 2021

  3. [3]

    Distributed kalman filter with ultimately accurate fused measurement covariance,

    T. Yang, J. Qian, Z. Duan, and Z. Sun, “Distributed kalman filter with ultimately accurate fused measurement covariance,”IEEE Transactions on Automatic Control, pp. 1–16, 2025

  4. [4]

    Consensus of multiagent systems and synchronization of complex networks: A unified viewpoint,

    Z. Li, Z. Duan, G. Chen, and L. Huang, “Consensus of multiagent systems and synchronization of complex networks: A unified viewpoint,” IEEE Transactions on Circuits and Systems I: Regular Papers, vol. 57, no. 1, pp. 213–224, 2009

  5. [5]

    Pinning synchronization of directed networks with switching topologies: A multiple lyapunov func- tions approach,

    G. Wen, W. Yu, G. Hu, J. Cao, and X. Yu, “Pinning synchronization of directed networks with switching topologies: A multiple lyapunov func- tions approach,”IEEE Transactions on Neural Networks and Learning Systems, vol. 26, no. 12, pp. 3239–3250, 2015

  6. [6]

    Optimal consensus control strategy for multi-agent systems under cyber attacks via a stackelberg game approach,

    P. Yu, Y . Hu, Y . Wang, R. Jia, and J. Guo, “Optimal consensus control strategy for multi-agent systems under cyber attacks via a stackelberg game approach,”IEEE Transactions on Automation Science and Engineering, vol. 22, pp. 18875–18888, 2025

  7. [7]

    Multitime scale consensus algorithm of multiagent systems with binary-valued data under tampering attacks,

    R. Jia, T. Wang, W. Xue, J. Guo, and Y . Zhao, “Multitime scale consensus algorithm of multiagent systems with binary-valued data under tampering attacks,”IEEE Transactions on Industrial Informatics, vol. 21, no. 12, pp. 9377–9388, 2025

  8. [8]

    Bas ¸ar and P

    T. Bas ¸ar and P. Bernhard,H-infinity optimal control and related minimax design problems: a dynamic game approach. Springer Science & Business Media, 2008

  9. [9]

    K. Zhou, J. C. Doyle, K. Glover,et al.,Robust and optimal control, vol. 40. Prentice hall New Jersey, 1996

  10. [10]

    H ∞ control of networked multi-agent systems,

    Z. Li, Z. Duan, and L. Huang, “H ∞ control of networked multi-agent systems,”Journal of Systems Science and Complexity, vol. 22, p. 35–48, 2009

  11. [11]

    RobustH ∞ consensus control of uncertain multi- agent systems with time delays,

    Y . Liu and Y . Jia, “RobustH ∞ consensus control of uncertain multi- agent systems with time delays,”International Journal of Control, Automation and Systems, vol. 9, no. 6, pp. 1086–1094, 2011

  12. [12]

    H∞ consensus control of multi-agent systems with switching topology: a dynamic output feedback protocol,

    Y . Liu and Y . Jia, “H∞ consensus control of multi-agent systems with switching topology: a dynamic output feedback protocol,”International Journal of Control, vol. 83, no. 3, pp. 527–537, 2010

  13. [13]

    Consensus and itsL 2- gain performance of multi-agent systems with intermittent information transmissions,

    G. Wen, Z. Duan, Z. Li, and G. Chen, “Consensus and itsL 2- gain performance of multi-agent systems with intermittent information transmissions,”International Journal of Control, vol. 85, no. 4, pp. 384– 396, 2012

  14. [14]

    DistributedH ∞ andH 2 consen- sus control in directed networks,

    J. Wang, Z. Duan, Z. Li, and G. Wen, “DistributedH ∞ andH 2 consen- sus control in directed networks,”IET Control Theory & Applications, vol. 8, pp. 193–201(8), February 2014

  15. [15]

    Multi-agent differential graphical games: Online adaptive learning solution for syn- chronization with optimality,

    K. G. Vamvoudakis, F. L. Lewis, and G. R. Hudas, “Multi-agent differential graphical games: Online adaptive learning solution for syn- chronization with optimality,”Automatica, vol. 48, no. 8, pp. 1598–1611, 2012

  16. [16]

    Multi- agent zero-sum differential graphical games for disturbance rejection in distributed control,

    Q. Jiao, H. Modares, S. Xu, F. L. Lewis, and K. G. Vamvoudakis, “Multi- agent zero-sum differential graphical games for disturbance rejection in distributed control,”Automatica, vol. 69, pp. 24–34, 2016

  17. [17]

    Differential graphical games forH ∞ control of linear heterogeneous multiagent systems,

    F. Adib Yaghmaie, K. Hengster Movric, F. L. Lewis, and R. Su, “Differential graphical games forH ∞ control of linear heterogeneous multiagent systems,”International Journal of Robust and Nonlinear Control, vol. 29, no. 10, pp. 2995–3013, 2019

  18. [18]

    Optimal leader-following consensus control of multi-agent systems: A neural network based graphical game approach,

    Y . Ren, Q. Wang, and Z. Duan, “Optimal leader-following consensus control of multi-agent systems: A neural network based graphical game approach,”IEEE Transactions on Network Science and Engineering, vol. 9, no. 5, pp. 3590–3601, 2022

  19. [19]

    A distributed minmax strategy for consensus tracking control in multiagent differential graphical games: A model-free approach,

    Y . Zhou, J. Zhou, G. Wen, M. Gan, and T. Yang, “A distributed minmax strategy for consensus tracking control in multiagent differential graphical games: A model-free approach,”IEEE Systems, Man, and Cybernetics Magazine, 2023

  20. [20]

    Game-theoretic event-triggered tracking control for scalable multi-agent systems,

    S. Zhu and F. Tan, “Game-theoretic event-triggered tracking control for scalable multi-agent systems,”European Journal of Control, p. 101401, 2025

  21. [21]

    Output-feedback Q-learning for discrete-time linearH ∞ tracking control: A stackelberg game ap- proach,

    Y . Ren, Q. Wang, and Z. Duan, “Output-feedback Q-learning for discrete-time linearH ∞ tracking control: A stackelberg game ap- proach,”International Journal of Robust and Nonlinear Control, vol. 32, no. 12, pp. 6805–6828, 2022

  22. [22]

    Distributed optimal control for linear multiagent systems on general digraphs,

    Z. Zhang, W. Yan, and H. Li, “Distributed optimal control for linear multiagent systems on general digraphs,”IEEE Transactions on Auto- matic Control, vol. 66, no. 1, pp. 322–328, 2021

  23. [23]

    Robust finite-time consensus tracking algorithm for multirobot systems,

    S. Khoo, L. Xie, and Z. Man, “Robust finite-time consensus tracking algorithm for multirobot systems,”IEEE/ASME Transactions on Mecha- tronics, vol. 14, no. 2, pp. 219–228, 2009

  24. [24]

    Distributed algorithm for the network size estimation: Blended dynamics approach,

    D. Lee, S. Lee, T. Kim, and H. Shim, “Distributed algorithm for the network size estimation: Blended dynamics approach,” in2018 IEEE Conference on Decision and Control (CDC), pp. 4577–4582, 2018

  25. [25]

    Distributed average tracking of multiple time-varying reference signals with bounded derivatives,

    F. Chen, Y . Cao, and W. Ren, “Distributed average tracking of multiple time-varying reference signals with bounded derivatives,”IEEE Trans- actions on Automatic Control, vol. 57, no. 12, pp. 3169–3174, 2012

  26. [26]

    Robust dynamic average consensus algorithms,

    J. George and R. A. Freeman, “Robust dynamic average consensus algorithms,”IEEE Transactions on Automatic Control, vol. 64, no. 11, pp. 4615–4622, 2019

  27. [27]

    Consensus problems in networks of agents with switching topology and time-delays,

    R. Olfati-Saber and R. Murray, “Consensus problems in networks of agents with switching topology and time-delays,”IEEE Transactions on Automatic Control, vol. 49, no. 9, pp. 1520–1533, 2004

  28. [28]

    Eigenvalues, diameter, and mean distance in graphs,

    B. Mohar, “Eigenvalues, diameter, and mean distance in graphs,”Graph. Comb., vol. 7, p. 53–64, mar 1991

  29. [29]

    Linear quadratic optimal consensus of discrete-time multi-agent systems with optimal steady state: A distributed model predictive control approach,

    Q. Wang, Z. Duan, Y . Lv, Q. Wang, and G. Chen, “Linear quadratic optimal consensus of discrete-time multi-agent systems with optimal steady state: A distributed model predictive control approach,”Auto- matica, vol. 127, p. 109505, 2021