Gray-Box Nonlinear Feedback Optimization

Florian D\"orfler; Michael Muehlebach; Saverio Bolognani; Zhiyu He

arxiv: 2404.04355 · v2 · pith:V32U3VNYnew · submitted 2024-04-05 · 🧮 math.OC · cs.SY· eess.SY

Gray-Box Nonlinear Feedback Optimization

Zhiyu He , Saverio Bolognani , Michael Muehlebach , Florian D\"orfler This is my paper

classification 🧮 math.OC cs.SYeess.SY

keywords approachesoptimizationclosed-loopcontrollerfeedbackgray-boxmodel-freesensitivity

0 comments

read the original abstract

Feedback optimization enables autonomous optimality seeking of a dynamical system through its closed-loop interconnection with iterative optimization algorithms. Among various iteration structures, model-based approaches require the input-output sensitivity matrix of the system to construct gradients, whereas model-free approaches eliminate this need by estimating gradients from real-time objective evaluations. These approaches offer complementary benefits in sample efficiency and accuracy against model mismatch, i.e., sensitivity errors. To achieve balanced closed-loop performance, we propose a gray-box feedback optimization controller, featuring systematic incorporation of approximate sensitivities into model-free updates via a tunable convex combination. We provide unified performance characterizations covering different approaches. We elucidate how cumulative sensitivity errors (model-based) and variances due to stochastic exploration (model-free) shape the closed-loop behavior and induce a trade-off between iteration and dimensional dependence. The proposed controller retains sample efficiency and provable (local) optimality for nonconvex problems despite inaccurate sensitivities. We further develop and characterize a running gray-box controller that handles constrained time-varying problems with changing objectives and steady-state input-output maps.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Policy Optimization for Unknown Systems using Differentiable Model Predictive Control
eess.SY 2025-11 unverdicted novelty 6.0

Hybrid differentiable-plus-zeroth-order policy optimization for MPC yields faster transients than pure data-driven methods while retaining convergence guarantees under model mismatch, demonstrated on a 12D quadcopter.