Model-Free Aggregative Cooperative Optimization via Randomized Gradient-Free Minimization and Exploration Momentum

Amir Mehrnoosh; Gianluca Bianchin; Giuseppe Notarstefano; Giuseppe Speciale; Riccardo Brumali

arxiv: 2604.07164 · v1 · submitted 2026-04-08 · 🧮 math.OC

Model-Free Aggregative Cooperative Optimization via Randomized Gradient-Free Minimization and Exploration Momentum

Amir Mehrnoosh , Giuseppe Speciale , Riccardo Brumali , Giuseppe Notarstefano , Gianluca Bianchin This is my paper

Pith reviewed 2026-05-10 17:20 UTC · model grok-4.3

classification 🧮 math.OC

keywords aggregative optimizationgradient-free methodsdistributed optimizationcooperative optimizationrandomized algorithmsmomentum methodsmodel-free optimization

0 comments

The pith

A randomized gradient-free algorithm solves aggregative cooperative optimization without true gradients.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

Aggregative cooperative optimization requires agents to minimize objectives that depend on both local decisions and a shared aggregate variable representing global behavior. The paper introduces ARGFree, which performs this task using only function evaluations by combining randomized finite-difference gradient approximations with distributed tracking variables that emulate gradient-based updates. It establishes convergence in expectation to an approximate optimizer, where the gap arises solely from the estimator's randomness. A variant called ARGFree-EM adds momentum to the exploration signals to reduce fluctuations and improve tracking accuracy in high dimensions.

Core claim

ARGFree is the first method capable of solving aggregative cooperative optimization problems without gradient information, by pairing randomized finite-difference approximations with a set of tracking variables that replicate the dynamics of gradient-based distributed algorithms, achieving convergence in expectation to an approximate optimizer; the ARGFree-EM extension incorporates momentum in the exploration signals to smooth fluctuations and thereby tighten the distributed tracking mechanism.

What carries the argument

ARGFree algorithm that emulates gradient descent via randomized finite-difference gradient approximations combined with distributed tracking variables, plus the momentum-enhanced ARGFree-EM variant.

If this is right

Agents can reach approximate solutions to aggregative problems using only local function evaluations and neighbor communication.
The approximation error is bounded and traceable directly to the finite-difference estimator rather than to communication or distribution.
Momentum in exploration signals reduces variance in the tracking variables and improves solution quality as dimension grows.
The framework applies to any setting where true gradients are unavailable or costly, such as black-box or sensor-driven systems.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same finite-difference-plus-tracking structure could be tested in non-aggregative distributed problems where only partial gradients are missing.
In practice the method suggests a tunable knob between exploration noise level and final accuracy that could be calibrated on real multi-agent hardware.
If the momentum term generalizes, similar acceleration might reduce communication rounds needed for convergence in other gradient-free distributed schemes.

Load-bearing premise

Finite-difference approximations and tracking variables can reliably stand in for unavailable true gradients in distributed aggregative problems.

What would settle it

Implement ARGFree on a low-dimensional aggregative benchmark whose optimum is known exactly, then check whether the achieved error remains inside the bound predicted by the randomized estimator or grows without bound.

Figures

Figures reproduced from arXiv: 2604.07164 by Amir Mehrnoosh, Gianluca Bianchin, Giuseppe Notarstefano, Giuseppe Speciale, Riccardo Brumali.

**Figure 2.** Figure 2: Illustrative applications of the aggregative cooperative optimization framework ( [PITH_FULL_IMAGE:figures/full_fig_p011_2.png] view at source ↗

**Figure 3.** Figure 3: (Left) Robot trajectories under the algorithm of [7], which relies on exact gradient information. (Center) Trajectories obtained with ARGFree (Algorithm 1). (Right) Trajectories obtained with ARGFree-EM (Algorithm 2). Both proposed methods achieve performance comparable to the gradientbased approach while requiring no gradient information. Qualitatively, ARGFree-EM produces smoother trajectories that more… view at source ↗

**Figure 4.** Figure 4: Evolution of the relative loss. (a) Noiseless scenario. (b) Noisy [PITH_FULL_IMAGE:figures/full_fig_p012_4.png] view at source ↗

**Figure 5.** Figure 5: Evolution of the gradient norm ∥∇f(xk)∥ evaluated at the current iterates. (a) Noiseless scenario. (b) Noisy scenario. These results corroborate the relative loss findings, demonstrating the robustness of direct function evaluations against gradient-amplified noise. son, our algorithm relies solely on direct evaluations of the objective function (i.e., measurements of position-dependent signals), which ten… view at source ↗

read the original abstract

Aggregative cooperative optimization problems arise in distributed decision-making settings where each agent's objective depends on its own decision as well as on an aggregate variable capturing global system behavior. Motivated by practical scenarios where gradient information is unavailable, this paper introduces a randomized gradient-free algorithm, named ARGFree, for solving such problems. ARGFree combines finite-difference gradient approximations with a set of tracking variables, emulating the behavior of a gradient-based method. We prove that ARGFree converges in expectation to an approximate optimizer, with the approximation error stemming from the use of a randomized gradient estimator. To enhance performance in high-dimensional settings, we further propose an improved variant, ARGFree-EM, which incorporates momentum in the exploration signals to smooth sudden fluctuations in the gradient exploration signals and thereby improve the accuracy of the underlying distributed tracking mechanism. To the best of our knowledge, the class of ARGFree methods is the first in the literature capable of solving aggregating cooperative optimization problems without gradient information.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

This paper gives the first gradient-free method for aggregative cooperative optimization with an expectation convergence proof and a momentum variant to reduce variance.

read the letter

The core contribution is a randomized gradient-free algorithm, ARGFree, for aggregative problems where each agent's cost depends on its local decision and a global aggregate. It uses finite-difference estimates plus tracking variables to mimic gradient tracking without needing actual gradients, and they prove convergence in expectation to a neighborhood whose size is set by the estimator bias. The ARGFree-EM version adds momentum to the exploration signals to damp fluctuations in high dimensions, which improves the tracking accuracy without changing the asymptotic bias term.

Referee Report

0 major / 3 minor

Summary. The manuscript introduces ARGFree, a randomized gradient-free algorithm for aggregative cooperative optimization problems. It uses finite-difference estimators combined with distributed tracking variables to emulate gradient-based behavior and proves convergence in expectation to an approximate optimizer, with the error arising from the randomized estimator. An enhanced variant ARGFree-EM adds momentum to the exploration signals to reduce variance and improve tracking accuracy in high dimensions. The authors position the method as the first in the literature capable of solving such problems without gradient information, under standard assumptions including Lipschitz continuity and bounded estimator variance.

Significance. If the stated convergence guarantees hold, the work fills a gap in model-free distributed optimization for aggregative problems common in multi-agent systems. The explicit error analysis separating bias from the one-point/two-point estimator and the demonstration that momentum reduces variance without changing the asymptotic bias term are useful technical contributions. The approach could enable practical deployment in black-box settings such as resource allocation or sensor networks.

minor comments (3)

[§3.2] §3.2, Algorithm 1: the update for the tracking variable x_i^{k+1} is written with a step-size α_k that is not explicitly linked to the conditions in Theorem 1; adding a cross-reference would clarify the parameter schedule.
[§4.3] §4.3, Eq. (18): the variance bound for the momentum-augmented estimator is stated as O(δ^2 + σ^2 / m), but the dependence on the momentum parameter β is only implicit; an explicit expression would strengthen the comparison to the non-momentum case.
[Figure 2] Figure 2: the caption does not indicate the dimension d or the number of agents N used in the numerical example, making it difficult to assess scalability claims.

Simulated Author's Rebuttal

0 responses · 0 unresolved

We thank the referee for the positive assessment of our manuscript, the recognition that ARGFree is the first gradient-free method for aggregative cooperative optimization, and the recommendation for minor revision. We appreciate the comments on the significance of the explicit error analysis and the momentum variant.

Circularity Check

0 steps flagged

No significant circularity in derivation chain

full rationale

The paper introduces the ARGFree algorithm combining finite-difference gradient estimators with distributed tracking variables for aggregative problems, then derives an expectation convergence result to an approximate optimizer. The analysis explicitly invokes standard assumptions (Lipschitz continuity of gradients, bounded variance of the randomized estimator, suitable step-size sequences) to bound tracking error and estimator bias. No step reduces by construction to a fitted parameter, self-referential definition, or load-bearing self-citation chain; the convergence bound is obtained from the algorithm dynamics and stated hypotheses rather than from renaming or smuggling prior results. The momentum variant ARGFree-EM is analyzed as a variance-reduction extension without altering the asymptotic bias term. The derivation is therefore self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

Based on abstract only; no explicit free parameters, invented entities, or non-standard axioms are stated. The convergence result implicitly rests on standard assumptions for stochastic approximation and finite-difference estimators in distributed settings.

axioms (1)

domain assumption The underlying optimization problem admits convergence of stochastic gradient-free methods with tracking variables
Invoked to support the stated convergence-in-expectation result.

pith-pipeline@v0.9.0 · 5478 in / 1219 out tokens · 34515 ms · 2026-05-10T17:20:54.364032+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

32 extracted references · 32 canonical work pages

[1]

Distributed optimization for control,

A. Nedi ´c and J. Liu, “Distributed optimization for control,”Annual Review of Control, Robotics, and Autonomous Systems, vol. 1, pp. 77– 103, 2018. doi: 10.1146/annurev-control-060117-105131

work page doi:10.1146/annurev-control-060117-105131 2018
[2]

Johansson

T. Yang, X. Yi, J. Wu, Y . Yuan, D. Wu, Z. Meng, Y . Hong, H. Wang, Z. Lin, and K. H. Johansson, “A survey of distributed optimiza- tion,”Annual Reviews in Control, vol. 47, pp. 278–305, 2019. doi: 10.1016/j.arcontrol.2019.05.006

work page doi:10.1016/j.arcontrol.2019.05.006 2019
[3]

Distributed opti- mization for smart cyber-physical networks,

G. Notarstefano, I. Notarnicola, and A. Camisa, “Distributed opti- mization for smart cyber-physical networks,”Foundations and Trends in Systems and Control, vol. 7, no. 3, pp. 253–383, 2019. doi: 10.1561/9781680836196

work page doi:10.1561/9781680836196 2019
[4]

Optimization of linear multi-agent dynamical systems via feedback distributed gradient descent methods,

A. Mehrnoosh and G. Bianchin, “Optimization of linear multi-agent dynamical systems via feedback distributed gradient descent methods,” in2025 American Control Conference (ACC). IEEE, 2025, pp. 4579– 4584

work page 2025
[5]

Nonconvex distributed feedback optimization for aggregative cooperative robotics,

G. Carnevale, N. Mimmo, and G. Notarstefano, “Nonconvex distributed feedback optimization for aggregative cooperative robotics,”Automatica, vol. 167, p. 111767, 2024. doi: 10.1016/j.automatica.2024.111767

work page doi:10.1016/j.automatica.2024.111767 2024
[6]

k-dimensional agreement in multiagent systems,

G. Bianchin, M. Vaquero, J. Cort ´es, and E. Dall’Anese, “k-dimensional agreement in multiagent systems,”IEEE Transactions on Auto- matic Control, vol. 69, no. 12, pp. 8978–08 985, Dec. 2024. doi: 10.1109/TAC.2024.3431108

work page doi:10.1109/tac.2024.3431108 2024
[7]

Distributed aggregative optimization over multi-agent networks,

X. Li, L. Xie, and Y . Hong, “Distributed aggregative optimization over multi-agent networks,”IEEE Transactions on Automatic Control, vol. 67, no. 6, pp. 3165–3171, 2022. doi: 10.1109/tac.2021.3095456

work page doi:10.1109/tac.2021.3095456 2022
[8]

Distributed Nash equilibrium seeking for aggregative games with coupled constraints,

S. Liang, P. Yi, and Y . Hong, “Distributed Nash equilibrium seeking for aggregative games with coupled constraints,”Automatica, vol. 85, pp. 179–185, 2017. doi: 10.1016/j.automatica.2017.07.064

work page doi:10.1016/j.automatica.2017.07.064 2017
[9]

Bianchin, J

G. Bianchin, J. Cort ´es, J. I. Poveda, and E. Dall’Anese, “Time-varying optimization of LTI systems via projected primal-dual gradient flows,” IEEE Transactions on Control of Network Systems, vol. 9, no. 1, pp. 474–486, Mar. 2022. doi: 10.1109/TCNS.2021.3112762

work page doi:10.1109/tcns.2021.3112762 2022
[10]

Distributed online convex optimization with an aggregative variable,

X. Li, X. Yi, and L. Xie, “Distributed online convex optimization with an aggregative variable,”IEEE Transactions on Control of Network Sys- tems, vol. 9, no. 1, pp. 438–449, 2021. doi: 10.1109/tcns.2021.3107480

work page doi:10.1109/tcns.2021.3107480 2021
[11]

Distributed projection-free algorithm for constrained aggregative optimization,

T. Wang and P. Yi, “Distributed projection-free algorithm for constrained aggregative optimization,”International Journal of Robust and Nonlin- ear Control, vol. 33, no. 10, pp. 5273–5288, 2023. doi: 10.1002/rnc.6640

work page doi:10.1002/rnc.6640 2023
[12]

Achieving linear con- vergence in distributed aggregative optimization over directed graphs,

L. Chen, G. Wen, X. Fang, J. Zhou, and J. Cao, “Achieving linear con- vergence in distributed aggregative optimization over directed graphs,” IEEE Transactions on Systems, Man, and Cybernetics: Systems, vol. 54, no. 7, pp. 4529–4541, 2024. doi: 10.1109/tsmc.2024.3382173

work page doi:10.1109/tsmc.2024.3382173 2024
[13]

In: 2022 IEEE 61st Conference on Decision and Control (CDC)

G. Carnevale and G. Notarstefano, “A learning-based distributed algo- rithm for personalized aggregative optimization,” inIEEE Conf. on De- cision and Control, 2022, pp. 1576–1581. doi: cdc51059.2022.9992678

work page arXiv 2022
[14]

Data-driven distributed optimization via aggregative tracking and deep-learning,

R. Brumali, G. Carnevale, and G. Notarstefano, “Data-driven distributed optimization via aggregative tracking and deep-learning,”arXiv preprint, 2025, arXiv:2503.04668

work page arXiv 2025
[15]

Random gradient-free minimization of convex functions

Y . Nesterov and V . Spokoiny, “Random gradient-free minimization of convex functions,”Foundations of Computational Mathematics, vol. 17, pp. 527–566, 2017. doi: 10.1007/s10208-015-9296-2

work page doi:10.1007/s10208-015-9296-2 2017
[16]

Distributed zero-order algorithms for nonconvex multiagent optimization,

Y . Tang, J. Zhang, and N. Li, “Distributed zero-order algorithms for nonconvex multiagent optimization,”IEEE Transactions on Con- trol of Network Systems, vol. 8, no. 1, pp. 269–281, 2020. doi: 10.1109/tcns.2020.3024321

work page doi:10.1109/tcns.2020.3024321 2020
[17]

ZONE: Zeroth-order non- convex multiagent optimization over networks,

D. Hajinezhad, M. Hong, and A. Garcia, “ZONE: Zeroth-order non- convex multiagent optimization over networks,”IEEE Transactions on Automatic Control, vol. 64, no. 10, pp. 3995–4010, 2019. doi: 10.1109/TAC.2019.2896025

work page doi:10.1109/tac.2019.2896025 2019
[18]

Single point-based distributed zeroth- order optimization with a non-convex stochastic objective function,

E. Mhanna and M. Assaad, “Single point-based distributed zeroth- order optimization with a non-convex stochastic objective function,” in International Conference on Machine Learning. PMLR, 2023, pp. 24 701–24 719

work page 2023
[19]

Zero-gradient-sum algorithms for distributed convex optimization: The continuous-time case,

J. Lu and C. Y . Tang, “Zero-gradient-sum algorithms for distributed convex optimization: The continuous-time case,”IEEE Transactions on Automatic Control, vol. 57, no. 9, pp. 2348–2354, 2012. doi: 10.1109/acc.2011.5991466

work page doi:10.1109/acc.2011.5991466 2012
[20]

Randomized gradient-free distributed optimization methods for a multiagent system with unknown cost function,

Y . Pang and G. Hu, “Randomized gradient-free distributed optimization methods for a multiagent system with unknown cost function,”IEEE Transactions on Automatic Control, vol. 65, no. 1, pp. 333–340, 2019. doi: 10.1109/tac.2019.2914025

work page doi:10.1109/tac.2019.2914025 2019
[21]

Randomized gradient-free method for multi- agent optimization over time-varying networks,

D. Yuan and D. W. Ho, “Randomized gradient-free method for multi- agent optimization over time-varying networks,”IEEE transactions on neural networks and learning systems, vol. 26, no. 6, pp. 1342–1347,

work page
[22]

doi: 10.1109/tnnls.2014.2336806

work page doi:10.1109/tnnls.2014.2336806 2014
[23]

Decentralized zeroth-order constrained stochas- tic optimization algorithms: Frank–Wolfe and variants with applications to black-box adversarial attacks,

A. K. Sahu and S. Kar, “Decentralized zeroth-order constrained stochas- tic optimization algorithms: Frank–Wolfe and variants with applications to black-box adversarial attacks,”Proceedings of the IEEE, vol. 108, no. 11, pp. 1890–1905, 2020. doi: 10.1109/jproc.2020.3012609

work page doi:10.1109/jproc.2020.3012609 1905
[24]

Random gradient-free optimization for multiagent systems with communication noises under a time-varying weight balanced digraph,

D. Wang, J. Zhou, Z. Wang, and W. Wang, “Random gradient-free optimization for multiagent systems with communication noises under a time-varying weight balanced digraph,”IEEE Transactions on Systems, Man, and Cybernetics: Systems, vol. 50, no. 1, pp. 281–289, 2017. doi: 10.1109/tsmc.2017.2757265

work page doi:10.1109/tsmc.2017.2757265 2017
[25]

Fast optimization with zeroth-order feedback in distributed, multi-user MIMO systems,

O. Bilenne, P. Mertikopoulos, and E. V . Belmega, “Fast optimization with zeroth-order feedback in distributed, multi-user MIMO systems,” IEEE Transactions on Signal Processing, vol. 68, pp. 6085–6100, 2020. doi: 10.1109/tsp.2020.3029983

work page doi:10.1109/tsp.2020.3029983 2020
[26]

Ex- tremum seeking tracking for derivative-free distributed optimiza- tion,

N. Mimmo, G. Carnevale, A. Testa, and G. Notarstefano, “Ex- tremum seeking tracking for derivative-free distributed optimiza- tion,”IEEE Transactions on Control of Network Systems, 2024. doi: 10.1109/TCNS.2024.3510368 (Early access)

work page doi:10.1109/tcns.2024.3510368 2024
[27]

Zeroth-order learning in continuous games via residual pseudogradient estimates,

Y . Huang and J. Hu, “Zeroth-order learning in continuous games via residual pseudogradient estimates,”IEEE Transactions on Automatic Control, 2024. doi: 10.1109/tac.2024.3479874 (Early access)

work page doi:10.1109/tac.2024.3479874 2024
[28]

Convergence in multiagent coordination, consensus, and flocking,

V . D. Blondel, J. M. Hendrickx, A. Olshevsky, and J. N. Tsitsiklis, “Convergence in multiagent coordination, consensus, and flocking,” inIEEE Conf. on Decision and Control, 2005, pp. 2996–3000. doi: 10.1109/cdc.2005.1582620

work page doi:10.1109/cdc.2005.1582620 2005
[29]

On the linear quadratic data-driven control , year =

R. Olfati-Saber and R. M. Murray, “Consensus problems in networks of agents with switching topology and time-delays,”IEEE Transactions on Automatic Control, vol. 49, no. 9, pp. 1520–1533, 2004. doi: 10.23919/ecc.2007.7068297

work page doi:10.23919/ecc.2007.7068297 2004
[30]

A survey of consensus problems in multi-agent coordination,

W. Ren, R. W. Beard, and E. M. Atkins, “A survey of consensus problems in multi-agent coordination,” inAmerican Control Conference, Portland, OR, Jun. 2005, pp. 1859–1864. doi: 10.1109/acc.2005.1470239

work page doi:10.1109/acc.2005.1470239 2005
[31]

Distributed strategies for generating weight-balanced and doubly stochastic digraphs,

B. Gharesifard and J. Cort ´es, “Distributed strategies for generating weight-balanced and doubly stochastic digraphs,”European Journal of Control, vol. 18, no. 6, pp. 539–557, 2012. doi: 10.3166/ejc.18.539-557

work page doi:10.3166/ejc.18.539-557 2012
[32]

R. A. Horn and C. R. Johnson,Matrix Analysis. Cambridge University Press, 1985. ISBN 0521386322 14

work page 1985

[1] [1]

Distributed optimization for control,

A. Nedi ´c and J. Liu, “Distributed optimization for control,”Annual Review of Control, Robotics, and Autonomous Systems, vol. 1, pp. 77– 103, 2018. doi: 10.1146/annurev-control-060117-105131

work page doi:10.1146/annurev-control-060117-105131 2018

[2] [2]

Johansson

T. Yang, X. Yi, J. Wu, Y . Yuan, D. Wu, Z. Meng, Y . Hong, H. Wang, Z. Lin, and K. H. Johansson, “A survey of distributed optimiza- tion,”Annual Reviews in Control, vol. 47, pp. 278–305, 2019. doi: 10.1016/j.arcontrol.2019.05.006

work page doi:10.1016/j.arcontrol.2019.05.006 2019

[3] [3]

Distributed opti- mization for smart cyber-physical networks,

G. Notarstefano, I. Notarnicola, and A. Camisa, “Distributed opti- mization for smart cyber-physical networks,”Foundations and Trends in Systems and Control, vol. 7, no. 3, pp. 253–383, 2019. doi: 10.1561/9781680836196

work page doi:10.1561/9781680836196 2019

[4] [4]

Optimization of linear multi-agent dynamical systems via feedback distributed gradient descent methods,

A. Mehrnoosh and G. Bianchin, “Optimization of linear multi-agent dynamical systems via feedback distributed gradient descent methods,” in2025 American Control Conference (ACC). IEEE, 2025, pp. 4579– 4584

work page 2025

[5] [5]

Nonconvex distributed feedback optimization for aggregative cooperative robotics,

G. Carnevale, N. Mimmo, and G. Notarstefano, “Nonconvex distributed feedback optimization for aggregative cooperative robotics,”Automatica, vol. 167, p. 111767, 2024. doi: 10.1016/j.automatica.2024.111767

work page doi:10.1016/j.automatica.2024.111767 2024

[6] [6]

k-dimensional agreement in multiagent systems,

G. Bianchin, M. Vaquero, J. Cort ´es, and E. Dall’Anese, “k-dimensional agreement in multiagent systems,”IEEE Transactions on Auto- matic Control, vol. 69, no. 12, pp. 8978–08 985, Dec. 2024. doi: 10.1109/TAC.2024.3431108

work page doi:10.1109/tac.2024.3431108 2024

[7] [7]

Distributed aggregative optimization over multi-agent networks,

X. Li, L. Xie, and Y . Hong, “Distributed aggregative optimization over multi-agent networks,”IEEE Transactions on Automatic Control, vol. 67, no. 6, pp. 3165–3171, 2022. doi: 10.1109/tac.2021.3095456

work page doi:10.1109/tac.2021.3095456 2022

[8] [8]

Distributed Nash equilibrium seeking for aggregative games with coupled constraints,

S. Liang, P. Yi, and Y . Hong, “Distributed Nash equilibrium seeking for aggregative games with coupled constraints,”Automatica, vol. 85, pp. 179–185, 2017. doi: 10.1016/j.automatica.2017.07.064

work page doi:10.1016/j.automatica.2017.07.064 2017

[9] [9]

Bianchin, J

G. Bianchin, J. Cort ´es, J. I. Poveda, and E. Dall’Anese, “Time-varying optimization of LTI systems via projected primal-dual gradient flows,” IEEE Transactions on Control of Network Systems, vol. 9, no. 1, pp. 474–486, Mar. 2022. doi: 10.1109/TCNS.2021.3112762

work page doi:10.1109/tcns.2021.3112762 2022

[10] [10]

Distributed online convex optimization with an aggregative variable,

X. Li, X. Yi, and L. Xie, “Distributed online convex optimization with an aggregative variable,”IEEE Transactions on Control of Network Sys- tems, vol. 9, no. 1, pp. 438–449, 2021. doi: 10.1109/tcns.2021.3107480

work page doi:10.1109/tcns.2021.3107480 2021

[11] [11]

Distributed projection-free algorithm for constrained aggregative optimization,

T. Wang and P. Yi, “Distributed projection-free algorithm for constrained aggregative optimization,”International Journal of Robust and Nonlin- ear Control, vol. 33, no. 10, pp. 5273–5288, 2023. doi: 10.1002/rnc.6640

work page doi:10.1002/rnc.6640 2023

[12] [12]

Achieving linear con- vergence in distributed aggregative optimization over directed graphs,

L. Chen, G. Wen, X. Fang, J. Zhou, and J. Cao, “Achieving linear con- vergence in distributed aggregative optimization over directed graphs,” IEEE Transactions on Systems, Man, and Cybernetics: Systems, vol. 54, no. 7, pp. 4529–4541, 2024. doi: 10.1109/tsmc.2024.3382173

work page doi:10.1109/tsmc.2024.3382173 2024

[13] [13]

In: 2022 IEEE 61st Conference on Decision and Control (CDC)

G. Carnevale and G. Notarstefano, “A learning-based distributed algo- rithm for personalized aggregative optimization,” inIEEE Conf. on De- cision and Control, 2022, pp. 1576–1581. doi: cdc51059.2022.9992678

work page arXiv 2022

[14] [14]

Data-driven distributed optimization via aggregative tracking and deep-learning,

R. Brumali, G. Carnevale, and G. Notarstefano, “Data-driven distributed optimization via aggregative tracking and deep-learning,”arXiv preprint, 2025, arXiv:2503.04668

work page arXiv 2025

[15] [15]

Random gradient-free minimization of convex functions

Y . Nesterov and V . Spokoiny, “Random gradient-free minimization of convex functions,”Foundations of Computational Mathematics, vol. 17, pp. 527–566, 2017. doi: 10.1007/s10208-015-9296-2

work page doi:10.1007/s10208-015-9296-2 2017

[16] [16]

Distributed zero-order algorithms for nonconvex multiagent optimization,

Y . Tang, J. Zhang, and N. Li, “Distributed zero-order algorithms for nonconvex multiagent optimization,”IEEE Transactions on Con- trol of Network Systems, vol. 8, no. 1, pp. 269–281, 2020. doi: 10.1109/tcns.2020.3024321

work page doi:10.1109/tcns.2020.3024321 2020

[17] [17]

ZONE: Zeroth-order non- convex multiagent optimization over networks,

D. Hajinezhad, M. Hong, and A. Garcia, “ZONE: Zeroth-order non- convex multiagent optimization over networks,”IEEE Transactions on Automatic Control, vol. 64, no. 10, pp. 3995–4010, 2019. doi: 10.1109/TAC.2019.2896025

work page doi:10.1109/tac.2019.2896025 2019

[18] [18]

Single point-based distributed zeroth- order optimization with a non-convex stochastic objective function,

E. Mhanna and M. Assaad, “Single point-based distributed zeroth- order optimization with a non-convex stochastic objective function,” in International Conference on Machine Learning. PMLR, 2023, pp. 24 701–24 719

work page 2023

[19] [19]

Zero-gradient-sum algorithms for distributed convex optimization: The continuous-time case,

J. Lu and C. Y . Tang, “Zero-gradient-sum algorithms for distributed convex optimization: The continuous-time case,”IEEE Transactions on Automatic Control, vol. 57, no. 9, pp. 2348–2354, 2012. doi: 10.1109/acc.2011.5991466

work page doi:10.1109/acc.2011.5991466 2012

[20] [20]

Randomized gradient-free distributed optimization methods for a multiagent system with unknown cost function,

Y . Pang and G. Hu, “Randomized gradient-free distributed optimization methods for a multiagent system with unknown cost function,”IEEE Transactions on Automatic Control, vol. 65, no. 1, pp. 333–340, 2019. doi: 10.1109/tac.2019.2914025

work page doi:10.1109/tac.2019.2914025 2019

[21] [21]

Randomized gradient-free method for multi- agent optimization over time-varying networks,

D. Yuan and D. W. Ho, “Randomized gradient-free method for multi- agent optimization over time-varying networks,”IEEE transactions on neural networks and learning systems, vol. 26, no. 6, pp. 1342–1347,

work page

[22] [22]

doi: 10.1109/tnnls.2014.2336806

work page doi:10.1109/tnnls.2014.2336806 2014

[23] [23]

Decentralized zeroth-order constrained stochas- tic optimization algorithms: Frank–Wolfe and variants with applications to black-box adversarial attacks,

A. K. Sahu and S. Kar, “Decentralized zeroth-order constrained stochas- tic optimization algorithms: Frank–Wolfe and variants with applications to black-box adversarial attacks,”Proceedings of the IEEE, vol. 108, no. 11, pp. 1890–1905, 2020. doi: 10.1109/jproc.2020.3012609

work page doi:10.1109/jproc.2020.3012609 1905

[24] [24]

Random gradient-free optimization for multiagent systems with communication noises under a time-varying weight balanced digraph,

D. Wang, J. Zhou, Z. Wang, and W. Wang, “Random gradient-free optimization for multiagent systems with communication noises under a time-varying weight balanced digraph,”IEEE Transactions on Systems, Man, and Cybernetics: Systems, vol. 50, no. 1, pp. 281–289, 2017. doi: 10.1109/tsmc.2017.2757265

work page doi:10.1109/tsmc.2017.2757265 2017

[25] [25]

Fast optimization with zeroth-order feedback in distributed, multi-user MIMO systems,

O. Bilenne, P. Mertikopoulos, and E. V . Belmega, “Fast optimization with zeroth-order feedback in distributed, multi-user MIMO systems,” IEEE Transactions on Signal Processing, vol. 68, pp. 6085–6100, 2020. doi: 10.1109/tsp.2020.3029983

work page doi:10.1109/tsp.2020.3029983 2020

[26] [26]

Ex- tremum seeking tracking for derivative-free distributed optimiza- tion,

N. Mimmo, G. Carnevale, A. Testa, and G. Notarstefano, “Ex- tremum seeking tracking for derivative-free distributed optimiza- tion,”IEEE Transactions on Control of Network Systems, 2024. doi: 10.1109/TCNS.2024.3510368 (Early access)

work page doi:10.1109/tcns.2024.3510368 2024

[27] [27]

Zeroth-order learning in continuous games via residual pseudogradient estimates,

Y . Huang and J. Hu, “Zeroth-order learning in continuous games via residual pseudogradient estimates,”IEEE Transactions on Automatic Control, 2024. doi: 10.1109/tac.2024.3479874 (Early access)

work page doi:10.1109/tac.2024.3479874 2024

[28] [28]

Convergence in multiagent coordination, consensus, and flocking,

V . D. Blondel, J. M. Hendrickx, A. Olshevsky, and J. N. Tsitsiklis, “Convergence in multiagent coordination, consensus, and flocking,” inIEEE Conf. on Decision and Control, 2005, pp. 2996–3000. doi: 10.1109/cdc.2005.1582620

work page doi:10.1109/cdc.2005.1582620 2005

[29] [29]

On the linear quadratic data-driven control , year =

R. Olfati-Saber and R. M. Murray, “Consensus problems in networks of agents with switching topology and time-delays,”IEEE Transactions on Automatic Control, vol. 49, no. 9, pp. 1520–1533, 2004. doi: 10.23919/ecc.2007.7068297

work page doi:10.23919/ecc.2007.7068297 2004

[30] [30]

A survey of consensus problems in multi-agent coordination,

W. Ren, R. W. Beard, and E. M. Atkins, “A survey of consensus problems in multi-agent coordination,” inAmerican Control Conference, Portland, OR, Jun. 2005, pp. 1859–1864. doi: 10.1109/acc.2005.1470239

work page doi:10.1109/acc.2005.1470239 2005

[31] [31]

Distributed strategies for generating weight-balanced and doubly stochastic digraphs,

B. Gharesifard and J. Cort ´es, “Distributed strategies for generating weight-balanced and doubly stochastic digraphs,”European Journal of Control, vol. 18, no. 6, pp. 539–557, 2012. doi: 10.3166/ejc.18.539-557

work page doi:10.3166/ejc.18.539-557 2012

[32] [32]

R. A. Horn and C. R. Johnson,Matrix Analysis. Cambridge University Press, 1985. ISBN 0521386322 14

work page 1985