Coalitional Zero-Sum Games for {H_(infty)} Leader-Following Consensus Control
Pith reviewed 2026-05-10 19:32 UTC · model grok-4.3
The pith
Formulating multi-agent leader-following consensus under adversarial attacks as a coalitional zero-sum game yields a distributed H∞ control law.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The paper claims that the robust leader-following control problem under adversarial attacks can be solved by formulating it as a coalitional zero-sum differential game whose solution is an H∞ controller, and that the associated high-dimensional GARE can be decomposed into uniform lower-dimensional GAREs whose solutions are combined via a dynamic average consensus algorithm to obtain a fully distributed control law.
What carries the argument
The coalitional min-max zero-sum game whose value function satisfies the high-dimensional GARE; this GARE is decomposed into lower-dimensional copies whose coupling is resolved by dynamic average consensus.
If this is right
- The game-derived policy guarantees H∞ performance against disturbances modeled as attacks.
- Each agent computes its control input using only local state information after the decomposition and consensus steps.
- The method applies directly to formation control of multi-vehicle systems whose dynamics have been feedback-linearized.
- Distributed implementation removes the need for a central node to solve or store the full high-dimensional Riccati equation.
Where Pith is reading between the lines
- If the exact decomposition holds, the same game-theoretic reduction could be applied to other network disturbance-rejection tasks that currently require centralized Riccati solutions.
- The dynamic average consensus step may produce transient mismatch during finite-time convergence, so performance bounds would need explicit finite-time analysis.
- The linear-system assumption leaves open whether the decomposition and consensus technique extend to nonlinear agent dynamics without losing the H∞ guarantee.
Load-bearing premise
The high-dimensional GARE can be split into multiple uniform lower-dimensional GAREs while exactly preserving the optimality and robustness properties of the original centralized solution.
What would settle it
A side-by-side simulation on the same multi-vehicle formation where the distributed control inputs and closed-loop trajectories from the decomposed GAREs plus consensus differ measurably from those produced by solving the full centralized GARE.
Figures
read the original abstract
This paper investigates the leader-following consensus problem for a class of multi-agent systems subject to adversarial attack-like external inputs. To address this, we formulate the robust leader-following control problem as a global coalitional min-max zero-sum game using differential game theory. Specifically, the agents' control inputs form a coalition to minimize a global cost function, while the attacks form an opposing coalition to maximize it. Notably, when these external adversarial attacks manifest as disturbances, the designed game-theoretic control policy systematically yields a robust $H_\infty$ control law. Addressing this problem inherently requires solving a high-dimensional generalized algebraic Riccati equation (GARE), which poses significant challenges for distributed computation and controller implementation. To overcome these challenges, we propose a two-fold approach. First, a decentralized computational strategy is devised to decompose the high-dimensional GARE into multiple uniform, lower-dimensional GAREs. Second, a dynamic average consensus-based decoupling algorithm is developed to resolve the inherent coupling structure of the robust control law, thereby facilitating its distributed implementation. Finally, numerical simulations on the formation control of multi-vehicle systems with feedback-linearized dynamics are conducted to validate the effectiveness of the proposed algorithms.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper formulates the H∞ leader-following consensus problem for multi-agent systems under adversarial disturbances as a coalitional min-max zero-sum differential game. The resulting high-dimensional generalized algebraic Riccati equation (GARE) is decomposed into multiple uniform lower-dimensional GAREs, and a dynamic average consensus algorithm is used to decouple the control law for distributed implementation. Effectiveness is illustrated via numerical simulations on formation control of feedback-linearized multi-vehicle systems.
Significance. If the decomposition exactly preserves the saddle-point solution and the H∞ performance bound, the approach would offer a practical route to distributed robust control for MAS, mitigating the computational burden of global GAREs while retaining theoretical guarantees. The simulation results on multi-vehicle formation provide concrete evidence of applicability, but the strength hinges on rigorous verification that the distributed law matches the centralized H∞ optimum.
major comments (2)
- [the proposed decentralized computational strategy] The central claim that the high-dimensional GARE decomposes into uniform lower-dimensional GAREs while preserving optimality and the H∞ bound (two-fold approach) requires explicit proof. For leader-following MAS the closed-loop dynamics involve the graph Laplacian; uniformity of the decomposed Riccati solutions holds only under restrictive assumptions (identical agents, regular undirected graphs). Without a general proof that the reassembled solution satisfies the original saddle-point condition, the distributed policy may lose the guaranteed disturbance attenuation level.
- [the dynamic average consensus-based decoupling algorithm] The dynamic average consensus decoupling step must be shown not to perturb the control input away from the exact H∞ saddle point. Any finite-time estimation error or communication delay introduces a mismatch that can violate the min-max optimality and degrade the closed-loop H∞ norm; the manuscript should quantify the resulting performance loss or provide a stability margin.
minor comments (2)
- Notation for the coalitional cost function and the disturbance attenuation level γ should be introduced with explicit definitions before the GARE is stated.
- The simulation section would benefit from a direct comparison of the achieved H∞ norm against the centralized solution and against a non-decomposed baseline to quantify any performance gap.
Simulated Author's Rebuttal
We thank the referee for the constructive and detailed comments, which help strengthen the theoretical foundations of our work. We address each major comment point by point below and will revise the manuscript accordingly.
read point-by-point responses
-
Referee: The central claim that the high-dimensional GARE decomposes into uniform lower-dimensional GAREs while preserving optimality and the H∞ bound (two-fold approach) requires explicit proof. For leader-following MAS the closed-loop dynamics involve the graph Laplacian; uniformity of the decomposed Riccati solutions holds only under restrictive assumptions (identical agents, regular undirected graphs). Without a general proof that the reassembled solution satisfies the original saddle-point condition, the distributed policy may lose the guaranteed disturbance attenuation level.
Authors: We agree that an explicit proof is required to rigorously establish preservation of the saddle-point solution and the H∞ performance bound. The manuscript develops the decomposition specifically for identical agents over undirected regular graphs, where the Laplacian structure enables uniform lower-dimensional GAREs. In the revised version we will insert a new theorem together with a complete proof demonstrating that the reassembled solution satisfies the original min-max saddle-point condition under these assumptions, thereby retaining the guaranteed disturbance attenuation level. We will also clarify the assumptions and note that uniformity need not hold for non-identical agents or directed graphs. revision: yes
-
Referee: The dynamic average consensus decoupling step must be shown not to perturb the control input away from the exact H∞ saddle point. Any finite-time estimation error or communication delay introduces a mismatch that can violate the min-max optimality and degrade the closed-loop H∞ norm; the manuscript should quantify the resulting performance loss or provide a stability margin.
Authors: The dynamic average consensus protocol converges asymptotically to the exact average, so the distributed control law converges to the centralized H∞ saddle-point solution. To address finite-time estimation errors and bounded communication delays, the revised manuscript will include a Lyapunov-based analysis that quantifies the resulting deviation from the optimal H∞ norm and supplies an explicit stability margin together with a bound on performance degradation. revision: yes
Circularity Check
No circularity: standard differential game theory applied to H∞ formulation; decomposition and consensus steps are constructive proposals, not reductions to inputs or self-citations
full rationale
The derivation begins with a standard formulation of the leader-following consensus problem as a coalitional min-max zero-sum game drawn from differential game theory, which directly yields the H∞ control law via the global GARE as a known consequence of the LQ setup. The subsequent two-fold approach (decomposition of the GARE into uniform lower-dimensional instances and dynamic-average-consensus decoupling) consists of explicitly constructed algorithms whose correctness is asserted via derivation rather than by redefining the target quantity in terms of itself or by load-bearing self-citation. No equation is shown to equal its own fitted parameter or prior result by construction, and the paper does not invoke uniqueness theorems or ansatzes from the authors' own prior work to force the outcome. The chain therefore remains self-contained against external benchmarks of game-theoretic H∞ control.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption The multi-agent system dynamics are linear and the performance index is quadratic.
Lean theorems connected to this paper
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
formulate the robust leader-following control problem as a global coalitional min-max zero-sum game... high-dimensional generalized algebraic Riccati equation (GARE)... decompose the high-dimensional GARE into multiple uniform, lower-dimensional GAREs
-
IndisputableMonolith/Foundation/BranchSelection.leanbranch_selection unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
quadratic performance function J(δ,u,w)=1/2∫(δᵀQδ+uᵀRu−wᵀΓw)dt
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
Consensus building in multi-vehicle systems with information feedback,
W. Ren, “Consensus building in multi-vehicle systems with information feedback,” in2006 International Conference on Mechatronics and Automation, pp. 37–42, 2006
work page 2006
-
[2]
D. Liang, Y . Yang, R. Li, and R. Liu, “Finite-frequencyH −/H∞ unknown input observer-based distributed fault detection for multi-agent systems,”Journal of the Franklin Institute, vol. 358, no. 6, pp. 3258– 3275, 2021
work page 2021
-
[3]
Distributed kalman filter with ultimately accurate fused measurement covariance,
T. Yang, J. Qian, Z. Duan, and Z. Sun, “Distributed kalman filter with ultimately accurate fused measurement covariance,”IEEE Transactions on Automatic Control, pp. 1–16, 2025
work page 2025
-
[4]
Consensus of multiagent systems and synchronization of complex networks: A unified viewpoint,
Z. Li, Z. Duan, G. Chen, and L. Huang, “Consensus of multiagent systems and synchronization of complex networks: A unified viewpoint,” IEEE Transactions on Circuits and Systems I: Regular Papers, vol. 57, no. 1, pp. 213–224, 2009
work page 2009
-
[5]
G. Wen, W. Yu, G. Hu, J. Cao, and X. Yu, “Pinning synchronization of directed networks with switching topologies: A multiple lyapunov func- tions approach,”IEEE Transactions on Neural Networks and Learning Systems, vol. 26, no. 12, pp. 3239–3250, 2015
work page 2015
-
[6]
P. Yu, Y . Hu, Y . Wang, R. Jia, and J. Guo, “Optimal consensus control strategy for multi-agent systems under cyber attacks via a stackelberg game approach,”IEEE Transactions on Automation Science and Engineering, vol. 22, pp. 18875–18888, 2025
work page 2025
-
[7]
R. Jia, T. Wang, W. Xue, J. Guo, and Y . Zhao, “Multitime scale consensus algorithm of multiagent systems with binary-valued data under tampering attacks,”IEEE Transactions on Industrial Informatics, vol. 21, no. 12, pp. 9377–9388, 2025
work page 2025
-
[8]
T. Bas ¸ar and P. Bernhard,H-infinity optimal control and related minimax design problems: a dynamic game approach. Springer Science & Business Media, 2008
work page 2008
-
[9]
K. Zhou, J. C. Doyle, K. Glover,et al.,Robust and optimal control, vol. 40. Prentice hall New Jersey, 1996
work page 1996
-
[10]
H ∞ control of networked multi-agent systems,
Z. Li, Z. Duan, and L. Huang, “H ∞ control of networked multi-agent systems,”Journal of Systems Science and Complexity, vol. 22, p. 35–48, 2009
work page 2009
-
[11]
RobustH ∞ consensus control of uncertain multi- agent systems with time delays,
Y . Liu and Y . Jia, “RobustH ∞ consensus control of uncertain multi- agent systems with time delays,”International Journal of Control, Automation and Systems, vol. 9, no. 6, pp. 1086–1094, 2011
work page 2011
-
[12]
Y . Liu and Y . Jia, “H∞ consensus control of multi-agent systems with switching topology: a dynamic output feedback protocol,”International Journal of Control, vol. 83, no. 3, pp. 527–537, 2010
work page 2010
-
[13]
G. Wen, Z. Duan, Z. Li, and G. Chen, “Consensus and itsL 2- gain performance of multi-agent systems with intermittent information transmissions,”International Journal of Control, vol. 85, no. 4, pp. 384– 396, 2012
work page 2012
-
[14]
DistributedH ∞ andH 2 consen- sus control in directed networks,
J. Wang, Z. Duan, Z. Li, and G. Wen, “DistributedH ∞ andH 2 consen- sus control in directed networks,”IET Control Theory & Applications, vol. 8, pp. 193–201(8), February 2014
work page 2014
-
[15]
K. G. Vamvoudakis, F. L. Lewis, and G. R. Hudas, “Multi-agent differential graphical games: Online adaptive learning solution for syn- chronization with optimality,”Automatica, vol. 48, no. 8, pp. 1598–1611, 2012
work page 2012
-
[16]
Multi- agent zero-sum differential graphical games for disturbance rejection in distributed control,
Q. Jiao, H. Modares, S. Xu, F. L. Lewis, and K. G. Vamvoudakis, “Multi- agent zero-sum differential graphical games for disturbance rejection in distributed control,”Automatica, vol. 69, pp. 24–34, 2016
work page 2016
-
[17]
Differential graphical games forH ∞ control of linear heterogeneous multiagent systems,
F. Adib Yaghmaie, K. Hengster Movric, F. L. Lewis, and R. Su, “Differential graphical games forH ∞ control of linear heterogeneous multiagent systems,”International Journal of Robust and Nonlinear Control, vol. 29, no. 10, pp. 2995–3013, 2019
work page 2019
-
[18]
Y . Ren, Q. Wang, and Z. Duan, “Optimal leader-following consensus control of multi-agent systems: A neural network based graphical game approach,”IEEE Transactions on Network Science and Engineering, vol. 9, no. 5, pp. 3590–3601, 2022
work page 2022
-
[19]
Y . Zhou, J. Zhou, G. Wen, M. Gan, and T. Yang, “A distributed minmax strategy for consensus tracking control in multiagent differential graphical games: A model-free approach,”IEEE Systems, Man, and Cybernetics Magazine, 2023
work page 2023
-
[20]
Game-theoretic event-triggered tracking control for scalable multi-agent systems,
S. Zhu and F. Tan, “Game-theoretic event-triggered tracking control for scalable multi-agent systems,”European Journal of Control, p. 101401, 2025
work page 2025
-
[21]
Y . Ren, Q. Wang, and Z. Duan, “Output-feedback Q-learning for discrete-time linearH ∞ tracking control: A stackelberg game ap- proach,”International Journal of Robust and Nonlinear Control, vol. 32, no. 12, pp. 6805–6828, 2022
work page 2022
-
[22]
Distributed optimal control for linear multiagent systems on general digraphs,
Z. Zhang, W. Yan, and H. Li, “Distributed optimal control for linear multiagent systems on general digraphs,”IEEE Transactions on Auto- matic Control, vol. 66, no. 1, pp. 322–328, 2021
work page 2021
-
[23]
Robust finite-time consensus tracking algorithm for multirobot systems,
S. Khoo, L. Xie, and Z. Man, “Robust finite-time consensus tracking algorithm for multirobot systems,”IEEE/ASME Transactions on Mecha- tronics, vol. 14, no. 2, pp. 219–228, 2009
work page 2009
-
[24]
Distributed algorithm for the network size estimation: Blended dynamics approach,
D. Lee, S. Lee, T. Kim, and H. Shim, “Distributed algorithm for the network size estimation: Blended dynamics approach,” in2018 IEEE Conference on Decision and Control (CDC), pp. 4577–4582, 2018
work page 2018
-
[25]
Distributed average tracking of multiple time-varying reference signals with bounded derivatives,
F. Chen, Y . Cao, and W. Ren, “Distributed average tracking of multiple time-varying reference signals with bounded derivatives,”IEEE Trans- actions on Automatic Control, vol. 57, no. 12, pp. 3169–3174, 2012
work page 2012
-
[26]
Robust dynamic average consensus algorithms,
J. George and R. A. Freeman, “Robust dynamic average consensus algorithms,”IEEE Transactions on Automatic Control, vol. 64, no. 11, pp. 4615–4622, 2019
work page 2019
-
[27]
Consensus problems in networks of agents with switching topology and time-delays,
R. Olfati-Saber and R. Murray, “Consensus problems in networks of agents with switching topology and time-delays,”IEEE Transactions on Automatic Control, vol. 49, no. 9, pp. 1520–1533, 2004
work page 2004
-
[28]
Eigenvalues, diameter, and mean distance in graphs,
B. Mohar, “Eigenvalues, diameter, and mean distance in graphs,”Graph. Comb., vol. 7, p. 53–64, mar 1991
work page 1991
-
[29]
Q. Wang, Z. Duan, Y . Lv, Q. Wang, and G. Chen, “Linear quadratic optimal consensus of discrete-time multi-agent systems with optimal steady state: A distributed model predictive control approach,”Auto- matica, vol. 127, p. 109505, 2021
work page 2021
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.