Projected Stochastic Gradient Descent with Decision Dependent Distributions: Extended Version
Pith reviewed 2026-06-26 19:45 UTC · model grok-4.3
The pith
Projected primal-dual algorithm with surrogate sets bounds mean-square tracking error for decision-dependent stochastic optimization.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The paper establishes an upper bound on the mean-square tracking error for a projected primal-dual algorithm that uses surrogate dual constraint sets in place of the true ones, for constrained stochastic optimization problems with decision-dependent distributions. The bound decomposes into interpretable terms reflecting the stochasticity of the problem, output measurement errors, time-variability of the problem, and the mismatch between surrogate and true dual constraint sets.
What carries the argument
Projected primal-dual algorithm that substitutes surrogate dual constraint sets for the true sets, with the resulting mean-square tracking-error bound decomposed into four terms.
If this is right
- Tracking error stays bounded despite the distribution shifting with each decision.
- Each of the four sources contributes additively and separately to the total error.
- The method requires only output measurements, not a full dynamic model.
- The mismatch term quantifies the price paid for using computationally simpler surrogate sets.
Where Pith is reading between the lines
- Choosing surrogates close to the true sets would drive the mismatch term toward zero and recover tighter bounds.
- The four-term split could guide algorithm tuning by revealing which error source dominates in practice.
- The same surrogate-substitution idea might extend to other feedback-based methods for time-varying problems.
- Numerical validation on power grids leaves open whether the bound remains useful in higher-dimensional or more rapidly varying settings.
Load-bearing premise
Surrogate dual constraint sets can be substituted for the true sets while preserving the stated tracking-error bound, with no quantitative closeness condition supplied.
What would settle it
Run the algorithm on a problem where the surrogate sets differ markedly from the true sets and check whether the observed mean-square tracking error remains below the sum of the four explicit terms.
Figures
read the original abstract
Online feedback optimization (OFO) leverages real-time output measurements to optimize the operation of networked systems without requiring full knowledge of system dynamics or disturbances. We develop an OFO approach for constrained stochastic optimization problems in which the distribution of the system's random parameters shifts in response to the control actions. We propose a projected primal-dual algorithm where the true dual constraint sets are replaced by surrogate sets. Our main result is an upper bound on the mean-square tracking error, which decomposes into four interpretable terms reflecting: (i) the stochasticity of the problem, (ii) output measurement errors, (iii) time-variability of the problem, and (iv) the mismatch between surrogate and true dual constraint sets. The theory is illustrated in a numerical experiment for power grids with price-responsive assets.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper develops a projected primal-dual algorithm for online feedback optimization of constrained stochastic problems in which the distribution of random parameters depends on the control actions. True dual constraint sets are replaced by surrogate sets. The central result is an upper bound on mean-square tracking error that decomposes into four terms reflecting stochasticity of the problem, output measurement errors, time-variability, and mismatch between surrogate and true dual sets. The theory is illustrated via a numerical experiment on power grids with price-responsive assets.
Significance. If the bound derivation holds, the work supplies an interpretable performance guarantee for OFO under decision-dependent distributions, with the explicit mismatch term addressing a practical modeling choice. The decomposition into four distinct sources of error is a clear strength for analysis and design. The power-grid numerical example provides concrete illustration. No machine-checked proofs or parameter-free derivations are claimed.
minor comments (2)
- [Abstract] Abstract: the existence of the bound is asserted without stating its explicit form, key assumptions, or a one-line proof sketch; adding these would improve readability without lengthening the abstract substantially.
- The title refers to 'Projected Stochastic Gradient Descent' while the abstract and contribution describe a 'projected primal-dual algorithm'; aligning the title with the algorithm class used would reduce potential confusion.
Simulated Author's Rebuttal
We thank the referee for the positive summary and significance assessment of our work, as well as the recommendation for minor revision. No specific major comments were raised in the report.
Circularity Check
No significant circularity identified
full rationale
The paper presents a derived upper bound on mean-square tracking error for a projected primal-dual algorithm in online feedback optimization, with the bound explicitly decomposed into four additive terms that include the surrogate-true mismatch as an independent contribution. No equations or fitted quantities are defined in terms of the target tracking error itself, and the abstract states the result as an upper bound obtained from the algorithm rather than by construction or renaming. The derivation chain is therefore self-contained against external benchmarks, with no load-bearing self-citation, self-definitional steps, or fitted-input-called-prediction patterns exhibited.
Axiom & Free-Parameter Ledger
Reference graph
Works this paper leans on
-
[1]
Feedback design for multi-agent systems: A saddle point approach,
F. D. Brunner, H.-B. D¨ urr, and C. Ebenbauer, “Feedback design for multi-agent systems: A saddle point approach,”IEEE Conference on Decision and Control, 2012
2012
-
[2]
Online optimization as a feedback controller: Stability and tracking,
M. Colombino, E. Dall’Anese, and A. Bernstein, “Online optimization as a feedback controller: Stability and tracking,”IEEE Transactions on Control of Network Systems, 2020
2020
-
[3]
Online primal-dual methods with measurement feedback for time-varying convex optimization,
A. Bernstein, E. Dall’Anese, and A. Simonetto, “Online primal-dual methods with measurement feedback for time-varying convex optimization,”IEEE Transactions on Signal Processing, 2019
2019
-
[4]
Optimization algorithms as robust feedback controllers,
A. Hauswirth, Z. He, S. Bolognani, G. Hug, and F. D¨ orfler, “Optimization algorithms as robust feedback controllers,”Annual Reviews in Control, 2024
2024
-
[5]
Stochastic optimization with decision-dependent distributions,
D. Drusvyatskiy and L. Xiao, “Stochastic optimization with decision-dependent distributions,”Mathe- matics of Operations Research, 2023
2023
-
[6]
Performative prediction,
J. Perdomo, T. Zrnic, C. Mendler-D¨ unner, and M. Hardt, “Performative prediction,”International Conference on Machine Learning, 2020
2020
-
[7]
Stochastic saddle point problems with decision-dependent distributions,
K. Wood and E. Dall’Anese, “Stochastic saddle point problems with decision-dependent distributions,” SIAM Journal on Optimization, 2023
2023
-
[8]
Time-varying feedback optimization for quadratic programs with heterogeneous gradient step sizes,
A. Bernstein, J. Comden, Y. Chen, and J. Wang, “Time-varying feedback optimization for quadratic programs with heterogeneous gradient step sizes,”IEEE Conference on Decision and Control, 2023
2023
-
[9]
Data-driven online convex optimization for control of dynamical systems,
M. Nonhoff and M. A. M¨ uller, “Data-driven online convex optimization for control of dynamical systems,” IEEE Conference on Decision and Control, 2021
2021
-
[10]
Stochastic online feedback optimization for networks of non-compliant agents,
C. Kalil Lauand and A. Bernstein, “Stochastic online feedback optimization for networks of non-compliant agents,”IEEE Conference on Decision and Control (pre-publication version arXiv:2508.21414), 2025
-
[11]
Stochastic optimization for performative prediction,
C. Mendler-D¨ unner, J. Perdomo, T. Zrnic, and M. Hardt, “Stochastic optimization for performative prediction,”Advances in Neural Information Processing Systems, 2020
2020
-
[12]
Decision-dependent stochastic optimization: The role of distribution dynamics,
Z. He, S. Bolognani, F. D¨ orfler, and M. Muehlebach, “Decision-dependent stochastic optimization: The role of distribution dynamics,”arXiv preprint arXiv:2503.07324, 2025
-
[13]
Outside the echo chamber: Optimizing the performative risk,
J. P. Miller, J. C. Perdomo, and T. Zrnic, “Outside the echo chamber: Optimizing the performative risk,”International Conference on Machine Learning, 2021
2021
-
[14]
Multiuser optimization: Distributed algorithms and error analysis,
J. Koshal, A. Nedi´ c, and U. V. Shanbhag, “Multiuser optimization: Distributed algorithms and error analysis,”SIAM Journal on Optimization, 2011
2011
-
[15]
Online convex optimization with stochastic constraints,
H. Yu, M. Neely, and X. Wei, “Online convex optimization with stochastic constraints,”Advances in Neural Information Processing Systems, 2017. 11
2017
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.