pith. sign in

arxiv: 2404.04355 · v2 · pith:V32U3VNYnew · submitted 2024-04-05 · 🧮 math.OC · cs.SY· eess.SY

Gray-Box Nonlinear Feedback Optimization

classification 🧮 math.OC cs.SYeess.SY
keywords approachesoptimizationclosed-loopcontrollerfeedbackgray-boxmodel-freesensitivity
0
0 comments X
read the original abstract

Feedback optimization enables autonomous optimality seeking of a dynamical system through its closed-loop interconnection with iterative optimization algorithms. Among various iteration structures, model-based approaches require the input-output sensitivity matrix of the system to construct gradients, whereas model-free approaches eliminate this need by estimating gradients from real-time objective evaluations. These approaches offer complementary benefits in sample efficiency and accuracy against model mismatch, i.e., sensitivity errors. To achieve balanced closed-loop performance, we propose a gray-box feedback optimization controller, featuring systematic incorporation of approximate sensitivities into model-free updates via a tunable convex combination. We provide unified performance characterizations covering different approaches. We elucidate how cumulative sensitivity errors (model-based) and variances due to stochastic exploration (model-free) shape the closed-loop behavior and induce a trade-off between iteration and dimensional dependence. The proposed controller retains sample efficiency and provable (local) optimality for nonconvex problems despite inaccurate sensitivities. We further develop and characterize a running gray-box controller that handles constrained time-varying problems with changing objectives and steady-state input-output maps.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Policy Optimization for Unknown Systems using Differentiable Model Predictive Control

    eess.SY 2025-11 unverdicted novelty 6.0

    Hybrid differentiable-plus-zeroth-order policy optimization for MPC yields faster transients than pure data-driven methods while retaining convergence guarantees under model mismatch, demonstrated on a 12D quadcopter.