Distributionally Robust Probabilistic Prediction for Stochastic Dynamical Systems

Jianping He; Tao Xu

arxiv: 2412.08858 · v2 · submitted 2024-12-12 · 🧮 math.OC

Distributionally Robust Probabilistic Prediction for Stochastic Dynamical Systems

Tao Xu , Jianping He This is my paper

Pith reviewed 2026-05-23 07:40 UTC · model grok-4.3

classification 🧮 math.OC

keywords distributionally robust optimizationprobabilistic predictionstochastic dynamical systemsmaximin problemambiguity setworst-case performanceEuclidean reformulation

0 comments

The pith

Probabilistic predictors for stochastic dynamical systems can achieve worst-case performance guarantees over an ambiguity set by transforming the maximin problem to Euclidean space.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces a distributionally robust probabilistic prediction framework for stochastic dynamical systems that provides predictors with guaranteed performance in the worst case over a set of possible systems. This is achieved by reformulating the original functional maximin optimization problem, which is intractable over probability measures, into an equivalent problem in Euclidean space. From this reformulation, two suboptimal predictors are derived by relaxing constraints: Noise-DRPP by relaxing the ambiguity set and Eig-DRPP by relaxing the predictor. Optimality gaps to the global optimum are also derived, and numerical simulations compare their performance.

Core claim

One can design probabilistic predictors that have worst-case performance guarantees over a pre-defined ambiguity set of stochastic dynamical systems, achieved by equivalently transforming the original maximin from function spaces to Euclidean spaces, leading to two suboptimal solutions with derived optimality gaps.

What carries the argument

The equivalent transformation of the functional maximin problem over probability measures to an optimization problem in Euclidean space.

If this is right

One can obtain predictors with worst-case guarantees for SDSs.
Two tractable suboptimal predictors, Noise-DRPP and Eig-DRPP, can be computed.
Optimality gaps between the suboptimal predictors and the global optimum can be derived.
Numerical simulations can compare performance under different SDSs.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

This reformulation approach might apply to other optimization problems involving probability measures in dynamical systems.
Implementing these predictors could improve robustness in control systems with model uncertainty.
Further relaxation techniques could lead to even more efficient predictors.

Load-bearing premise

The functional maximin problem over probability measures with densities with respect to the Lebesgue measure admits an equivalent reformulation as an optimization problem in Euclidean space whose solutions remain meaningful predictors for the original SDSs.

What would settle it

Finding an instance of an SDS where the Euclidean space solution does not provide a valid probabilistic predictor or fails to guarantee the worst-case performance over the ambiguity set.

Figures

Figures reproduced from arXiv: 2412.08858 by Jianping He, Tao Xu.

**Figure 2.** Figure 2: Prediction performance of different probabilistic predictors on different SDSs under different control strategies. [PITH_FULL_IMAGE:figures/full_fig_p011_2.png] view at source ↗

**Figure 3.** Figure 3: Predictive 90% confidence regions of probabilistic predictors for different SDSs under different control strategies. [PITH_FULL_IMAGE:figures/full_fig_p012_3.png] view at source ↗

read the original abstract

Probabilistic prediction of stochastic dynamical systems (SDSs) aims to accurately predict the conditional probability distributions of future states. However, accurate probabilistic predictions tightly hinge on accurate distributional information from a nominal model, which is hardly available in practice. To address this issue, we propose a novel functional-maximin-based distributionally robust probabilistic prediction (DRPP) framework. In this framework, one can design probabilistic predictors that have worst-case performance guarantees over a pre-defined ambiguity set of SDSs. Nevertheless, DRPP requires optimizing over the space of probability measures with density functions with respect to the Lebesgue measure, which is generally intractable. We develop a methodology that equivalently transforms the original maximin from function spaces to Euclidean spaces. Although it remains intractable to seek a global optimal solution, two suboptimal solutions are derived. By relaxing the constraints on the ambiguity set, we obtain a suboptimal predictor called Noise-DRPP. Relaxing the constraints on the predictor yields another suboptimal predictor, Eig-DRPP. Moreover, optimality gaps between the proposed predictors and the global optimal predictor are derived. Finally, we conduct elaborate numerical simulations to compare the performance of different predictors under different SDSs.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper gives a concrete way to move a functional maximin for robust probabilistic prediction into Euclidean space and supplies two named suboptimal predictors with gap bounds.

read the letter

The main thing here is a distributionally robust probabilistic prediction setup for stochastic dynamical systems that starts from a maximin over an ambiguity set and then converts the problem from function space to Euclidean space. From that conversion it produces two practical approximations: Noise-DRPP by loosening the ambiguity set and Eig-DRPP by loosening the predictor itself, plus explicit bounds on the optimality gaps for each. That combination of a reformulation plus named methods with gap derivations is the actual new material. It directly tackles the intractability that comes from optimizing over densities, which is a recurring issue in this corner of robust optimization. The numerical simulations mentioned at the end are also a plus for seeing how the predictors behave on concrete systems. The central claim that the Euclidean version preserves the original worst-case guarantees is the part that still needs verification. The abstract states the equivalence but does not include the steps or any small-scale check, so it is not yet clear how much structure is retained or whether the solutions remain meaningful predictors once mapped back. The gap derivations depend on the specific relaxations chosen, and without seeing those details it is hard to judge how tight or useful the bounds end up being. This is written for people already working on distributionally robust methods for dynamical systems or stochastic control. A reader who needs worst-case guarantees when the nominal model is unreliable could extract usable predictors and gap expressions from it. The paper engages the literature on ambiguity sets and maximin problems without obvious circularity. I would send it to peer review so the transformation and the simulations can be checked in full.

Referee Report

0 major / 3 minor

Summary. The paper proposes a distributionally robust probabilistic prediction (DRPP) framework for stochastic dynamical systems (SDSs) via a functional maximin formulation over a pre-defined ambiguity set of SDSs. It claims to develop an equivalent transformation of this maximin from the space of probability measures (with densities w.r.t. Lebesgue measure) to a Euclidean-space optimization problem, derives two suboptimal predictors (Noise-DRPP via relaxation of ambiguity-set constraints, and Eig-DRPP via relaxation of predictor constraints) together with their optimality gaps relative to the global optimum, and presents numerical simulations comparing performance under different SDSs.

Significance. If the claimed equivalence transformation is rigorously established and the derived optimality gaps correctly bound the suboptimality while preserving worst-case guarantees, the framework would provide a tractable route to robust probabilistic predictors for SDSs under distributional uncertainty. The explicit construction of suboptimal solutions with gap analysis and the numerical validation are positive features that could support applications in robust control and forecasting.

minor comments (3)

[Abstract / Introduction] The abstract states that the functional maximin is equivalently transformed to Euclidean space and that optimality gaps are derived, but the provided text supplies no equations or proof outline for the transformation. Adding a high-level sketch of the key steps (e.g., how the density constraint and maximin are mapped) in the introduction or a dedicated methodology subsection would improve readability without altering the technical content.
[Methodology (presumed section deriving Noise-DRPP and Eig-DRPP)] The two suboptimal predictors are obtained by relaxing different parts of the problem (ambiguity set vs. predictor). Clarifying in the text whether these relaxations are independent or could be combined, and whether the resulting Euclidean problems remain convex or admit efficient solvers, would help readers assess computational practicality.
[Numerical experiments] Numerical simulations are described as 'elaborate' and compare predictors under different SDSs, but the abstract does not specify the state dimensions, ambiguity-set parameterizations, or quantitative metrics (e.g., Wasserstein distance or prediction error). Including a brief table of simulation parameters and performance statistics in the main text would strengthen the empirical support.

Simulated Author's Rebuttal

0 responses · 0 unresolved

We thank the referee for the positive assessment of our work and the recommendation of minor revision. The referee's summary accurately captures the proposed DRPP framework, the functional-maximin formulation, the transformation to Euclidean-space optimization, the derivation of Noise-DRPP and Eig-DRPP predictors with optimality-gap bounds, and the numerical experiments.

Circularity Check

0 steps flagged

No significant circularity identified

full rationale

The paper constructs a DRPP framework from first principles by defining an ambiguity set of SDSs and transforming the functional maximin over densities w.r.t. Lebesgue measure into an equivalent Euclidean-space problem. Suboptimal predictors (Noise-DRPP, Eig-DRPP) are obtained explicitly by relaxing constraints on the ambiguity set or the predictor, with optimality gaps derived from those relaxations. No step reduces a claimed prediction or result to a fitted parameter, self-citation chain, or input by construction; the supplied abstract and description contain no equations that exhibit self-definitional equivalence or renaming of known results. The derivation is therefore self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

0 free parameters · 2 axioms · 0 invented entities

The central claim rests on the existence and well-definedness of a pre-defined ambiguity set of SDSs together with the validity of the functional-to-Euclidean transformation; no free parameters, invented entities, or non-standard axioms are explicitly introduced in the abstract.

axioms (2)

standard math Probability measures admit densities with respect to Lebesgue measure on the state space.
Invoked when the abstract states that DRPP requires optimizing over the space of probability measures with density functions w.r.t. the Lebesgue measure.
domain assumption The maximin problem over function spaces admits an equivalent finite-dimensional reformulation.
This is the key methodological step asserted without proof details in the abstract.

pith-pipeline@v0.9.0 · 5725 in / 1477 out tokens · 22522 ms · 2026-05-23T07:40:36.388073+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

45 extracted references · 45 canonical work pages

[1]

A novel probabilistic forecast system predicting anomalously warm 2018-2022 reinforcing the long-term global warming trend,

F. Sévellec and S. S. Drijfhout, “A novel probabilistic forecast system predicting anomalously warm 2018-2022 reinforcing the long-term global warming trend,” Nature Communications, vol. 9, no. 1, p. 3024, Aug. 2018

work page 2018
[2]

Evaluation of individual and ensemble probabilistic forecasts of covid-19 mortality in the united states,

E. Y . Cramer, E. L. Ray, V . K. Lopez, J. Bracher, A. Brennen, A. J. Castro Rivadeneira, A. Gerding, T. Gneiting, K. H. House, Y . Huang et al., “Evaluation of individual and ensemble probabilistic forecasts of covid-19 mortality in the united states,” Proceedings of the National Academy of Sciences , vol. 119, no. 15, 2022

work page 2022
[3]

The value of probabilistic prediction,

R. Buizza, “The value of probabilistic prediction,” Atmospheric Science Letters, vol. 9, no. 2, pp. 36–42, 2008

work page 2008
[4]

Probabilistically safe robot planning with confidence- based human predictions,

J. Fisac, A. Bajcsy, S. Herbert, D. Fridovich-Keil, S. Wang, C. Tomlin, and A. Dragan, “Probabilistically safe robot planning with confidence- based human predictions,” Robotics: Science and Systems XIV , 2018

work page 2018
[5]

Probabilistic forecasting,

T. Gneiting and M. Katzfuss, “Probabilistic forecasting,” Annual Review of Statistics and Its Application , vol. 1, no. 1, pp. 125–151, 2014. 13

work page 2014
[6]

Strictly proper scoring rules, prediction, and estimation,

T. Gneiting and A. E. Raftery, “Strictly proper scoring rules, prediction, and estimation,” Journal of the American Statistical Association , vol. 102, no. 477, pp. 359–378, Mar. 2007

work page 2007
[7]

Probabilistic prediction methods for nonlinear systems with application to stochastic model predictive control,

D. Landgraf, A. Völz, F. Berkel, K. Schmidt, T. Specker, and K. Graichen, “Probabilistic prediction methods for nonlinear systems with application to stochastic model predictive control,” Annual Reviews in Control, vol. 56, Jan. 2023

work page 2023
[8]

Uncertain systems,

D. Hinrichsen and A. J. Pritchard, “Uncertain systems,” in Mathemat- ical Systems Theory I: Modelling, State Space Analysis, Stability and Robustness, D. Hinrichsen and A. J. Pritchard, Eds. Berlin, Heidelberg: Springer, 2005, pp. 517–713

work page 2005
[9]

Schmüdgen, The Moment Problem, ser

K. Schmüdgen, The Moment Problem, ser. Graduate Texts in Mathemat- ics. Cham: Springer International Publishing, 2017, vol. 277

work page 2017
[10]

Distributionally robust optimization under mo- ment uncertainty with application to data-driven problems,

E. Delage and Y . Ye, “Distributionally robust optimization under mo- ment uncertainty with application to data-driven problems,” Operations Research, Jan. 2010

work page 2010
[11]

P. S. Maybeck, Stochastic Models, Estimation, and Control . Academic press, 1982

work page 1982
[12]

An efficient implementation of the second order extended kalman filter,

M. Roth and F. Gustafsson, “An efficient implementation of the second order extended kalman filter,” in 14th International Conference on Information Fusion, Jul. 2011, pp. 1–6

work page 2011
[13]

New developments in state estimation for nonlinear systems,

M. Nørgaard, N. K. Poulsen, and O. Ravn, “New developments in state estimation for nonlinear systems,”Automatica, vol. 36, no. 11, pp. 1627– 1638, Nov. 2000

work page 2000
[14]

A systematization of the unscented kalman filter theory,

H. M. T. Menegaz, J. Y . Ishihara, G. A. Borges, and A. N. Vargas, “A systematization of the unscented kalman filter theory,” IEEE Transac- tions on Automatic Control , vol. 60, no. 10, pp. 2583–2598, Oct. 2015

work page 2015
[15]

Derivative-free estimation methods: New results and performance analysis,

M. Šimandl and J. Duník, “Derivative-free estimation methods: New results and performance analysis,” Automatica, vol. 45, no. 7, pp. 1749– 1757, Jul. 2009

work page 2009
[16]

Recursive bayesian estimation using gaussian sums,

H. W. Sorenson and D. L. Alspach, “Recursive bayesian estimation using gaussian sums,” Automatica, vol. 7, no. 4, pp. 465–479, Jul. 1971

work page 1971
[17]

Nonlinear bayesian estimation using gaus- sian sum approximations,

D. Alspach and H. Sorenson, “Nonlinear bayesian estimation using gaus- sian sum approximations,” IEEE Transactions on Automatic Control , vol. 17, no. 4, pp. 439–448, Aug. 1972

work page 1972
[18]

Mixture kalman filters,

R. Chen and J. S. Liu, “Mixture kalman filters,” Journal of the Royal Statistical Society Series B: Statistical Methodology , vol. 62, no. 3, pp. 493–508, Sep. 2000

work page 2000
[19]

The homogeneous chaos,

N. Wiener, “The homogeneous chaos,” American Journal of Mathemat- ics, vol. 60, no. 4, pp. 897–936, 1938

work page 1938
[20]

An efficient method for stochastic optimal control with joint chance constraints for nonlinear systems,

J. A. Paulson and A. Mesbah, “An efficient method for stochastic optimal control with joint chance constraints for nonlinear systems,” International Journal of Robust and Nonlinear Control , vol. 29, no. 15, pp. 5017–5037, 2019

work page 2019
[21]

Särkkä and L

S. Särkkä and L. Svensson, Bayesian Filtering and Smoothing , 2nd ed., ser. Institute of Mathematical Statistics Textbooks. Cambridge: Cam- bridge University Press, 2023

work page 2023
[22]

Modern monte carlo methods for efficient uncertainty quan- tification and propagation: A survey,

J. Zhang, “Modern monte carlo methods for efficient uncertainty quan- tification and propagation: A survey,” WIREs Computational Statistics , vol. 13, no. 5, p. e1539, 2021

work page 2021
[23]

Stochastic linear model predictive control with chance constraints – a review,

M. Farina, L. Giulioni, and R. Scattolini, “Stochastic linear model predictive control with chance constraints – a review,”Journal of Process Control, vol. 44, pp. 53–67, Aug. 2016

work page 2016
[24]

Stochastic model predictive control: An overview and perspectives for future research,

A. Mesbah, “Stochastic model predictive control: An overview and perspectives for future research,” IEEE Control Systems Magazine , vol. 36, no. 6, pp. 30–44, Dec. 2016

work page 2016
[25]

Nonlinear stochastic model predictive control: Existence, measurability, and stochastic asymptotic stability,

R. D. McAllister and J. B. Rawlings, “Nonlinear stochastic model predictive control: Existence, measurability, and stochastic asymptotic stability,” IEEE Transactions on Automatic Control , vol. 68, no. 3, pp. 1524–1536, Mar. 2023

work page 2023
[26]

Distributionally robust chance constrained data-enabled predictive control,

J. Coulson, J. Lygeros, and F. Dörfler, “Distributionally robust chance constrained data-enabled predictive control,” IEEE Transactions on Automatic Control, vol. 67, no. 7, pp. 3289–3304, Jul. 2022

work page 2022
[27]

C. P. Robert, The Bayesian Choice , ser. Springer Texts in Statistics. New York, NY: Springer, 2007

work page 2007
[28]

Approximate bayesian forecasting,

D. T. Frazier, W. Maneesoonthorn, G. M. Martin, and B. P. M. McCabe, “Approximate bayesian forecasting,” International Journal of Forecast- ing, vol. 35, no. 2, pp. 521–539, Apr. 2019

work page 2019
[29]

Quantile regression: 40 years on,

R. Koenker, “Quantile regression: 40 years on,” Annual Review of Economics, vol. 9, no. V olume 9, 2017, pp. 155–176, Aug. 2017

work page 2017
[30]

Distributional regression forests for probabilistic precipitation forecasting in complex terrain,

L. Schlosser, T. Hothorn, R. Stauffer, and A. Zeileis, “Distributional regression forests for probabilistic precipitation forecasting in complex terrain,” The Annals of Applied Statistics , vol. 13, no. 3, pp. 1564–1589, Sep. 2019

work page 2019
[31]

NGBoost: Natural gradient boosting for probabilistic prediction,

T. Duan, A. Anand, D. Y . Ding, K. K. Thai, S. Basu, A. Ng, and A. Schuler, “NGBoost: Natural gradient boosting for probabilistic prediction,” in Proceedings of the 37th International Conference on Machine Learning. PMLR, Nov. 2020, pp. 2690–2700

work page 2020
[32]

DeepAR: Probabilistic forecasting with autoregressive recurrent networks,

D. Salinas, V . Flunkert, J. Gasthaus, and T. Januschowski, “DeepAR: Probabilistic forecasting with autoregressive recurrent networks,” Inter- national Journal of Forecasting , vol. 36, no. 3, pp. 1181–1191, Jul. 2020

work page 2020
[33]

A review of predictive uncertainty estimation with machine learning,

H. Tyralis and G. Papacharalampous, “A review of predictive uncertainty estimation with machine learning,”Artificial Intelligence Review, vol. 57, no. 4, p. 94, Mar. 2024

work page 2024
[34]

H. E. Scarf, K. J. Arrow, and S. Karlin, A Min-Max Solution of an Inventory Problem. Rand Corporation Santa Monica, 1957

work page 1957
[35]

The distribution free newsboy problem: Review and extensions,

G. Gallego and I. Moon, “The distribution free newsboy problem: Review and extensions,” Journal of the Operational Research Society , vol. 44, no. 8, pp. 825–834, Aug. 1993

work page 1993
[36]

Proper local scoring rules,

M. Parry, A. P. Dawid, and S. Lauritzen, “Proper local scoring rules,” The Annals of Statistics , vol. 40, no. 1, Feb. 2012

work page 2012
[37]

Distributionally robust optimization: A review on theory and applications,

F. Lin, X. Fang, and Z. Gao, “Distributionally robust optimization: A review on theory and applications,” Numerical Algebra, Control and Optimization, vol. 12, no. 1, pp. 159–212, 2022

work page 2022
[38]

R. S. Sutton and A. G. Barto, Reinforcement learning: An introduction . MIT press, 2018

work page 2018
[39]

Dynamic programming,

R. Bellman, “Dynamic programming,” Science, vol. 153, no. 3731, pp. 34–37, Jul. 1966

work page 1966
[40]

Checking local optimality in con- strained quadratic programming is np-hard,

P. M. Pardalos and G. Schnitger, “Checking local optimality in con- strained quadratic programming is np-hard,” Operations Research Let- ters, vol. 7, no. 1, pp. 33–35, Feb. 1988

work page 1988
[41]

What is local optimality in nonconvex-nonconcave minimax optimization?

C. Jin, P. Netrapalli, and M. Jordan, “What is local optimality in nonconvex-nonconcave minimax optimization?” in Proceedings of the 37th International Conference on Machine Learning . PMLR, Nov. 2020, pp. 4880–4889

work page 2020
[42]

The complexity of constrained min-max optimization,

C. Daskalakis, S. Skoulakis, and M. Zampetakis, “The complexity of constrained min-max optimization,” in Proceedings of the 53rd Annual ACM SIGACT Symposium on Theory of Computing , ser. STOC 2021. New York, NY , USA: Association for Computing Machinery, Jun. 2021, pp. 1466–1478

work page 2021
[43]

Limiting behaviors of nonconvex-nonconcave minimax optimization via continuous-time systems,

B. Grimmer, H. Lu, P. Worah, and V . Mirrokni, “Limiting behaviors of nonconvex-nonconcave minimax optimization via continuous-time systems,” in Proceedings of The 33rd International Conference on Algorithmic Learning Theory . PMLR, Mar. 2022, pp. 465–487

work page 2022
[44]

Implicit learning dynamics in stack- elberg games: Equilibria characterization, convergence analysis, and empirical study,

T. Fiez, B. Chasnov, and L. Ratliff, “Implicit learning dynamics in stack- elberg games: Equilibria characterization, convergence analysis, and empirical study,” in Proceedings of the 37th International Conference on Machine Learning . PMLR, Nov. 2020, pp. 3133–3144. APPENDIX A DISCUSSION ON CANONICALIZATION If we align pξk with pwk and optimize over pwk,...

work page 2020
[45]

= 1, Ew∼pwk [w] = µk γ3,k ¯Σk ⪯ Ew∼pwk h (w−¯µk) (w−¯µk)T i ⪯ γ2,k ¯Σk ¯Σk (µk − ¯µk) (µk − ¯µk)⊤ γ1,k ⪰ 0 pwk(w) ≥ 0 ∀w ∈ Rdx . (24) The objective is minimized when E h (wk −¯µk) (wk −¯µk)T i equals γ2,k ¯Σk, which means the solution for p∗ wk is the set n pwk | Ew∼pwk h (w−¯µk) (w−¯µk)T i = γ2,k ¯Σk o , and the objective value at (ˆp∗ k, Φ∗ k) is −1 2 d...

work page 2013

[1] [1]

A novel probabilistic forecast system predicting anomalously warm 2018-2022 reinforcing the long-term global warming trend,

F. Sévellec and S. S. Drijfhout, “A novel probabilistic forecast system predicting anomalously warm 2018-2022 reinforcing the long-term global warming trend,” Nature Communications, vol. 9, no. 1, p. 3024, Aug. 2018

work page 2018

[2] [2]

Evaluation of individual and ensemble probabilistic forecasts of covid-19 mortality in the united states,

E. Y . Cramer, E. L. Ray, V . K. Lopez, J. Bracher, A. Brennen, A. J. Castro Rivadeneira, A. Gerding, T. Gneiting, K. H. House, Y . Huang et al., “Evaluation of individual and ensemble probabilistic forecasts of covid-19 mortality in the united states,” Proceedings of the National Academy of Sciences , vol. 119, no. 15, 2022

work page 2022

[3] [3]

The value of probabilistic prediction,

R. Buizza, “The value of probabilistic prediction,” Atmospheric Science Letters, vol. 9, no. 2, pp. 36–42, 2008

work page 2008

[4] [4]

Probabilistically safe robot planning with confidence- based human predictions,

J. Fisac, A. Bajcsy, S. Herbert, D. Fridovich-Keil, S. Wang, C. Tomlin, and A. Dragan, “Probabilistically safe robot planning with confidence- based human predictions,” Robotics: Science and Systems XIV , 2018

work page 2018

[5] [5]

Probabilistic forecasting,

T. Gneiting and M. Katzfuss, “Probabilistic forecasting,” Annual Review of Statistics and Its Application , vol. 1, no. 1, pp. 125–151, 2014. 13

work page 2014

[6] [6]

Strictly proper scoring rules, prediction, and estimation,

T. Gneiting and A. E. Raftery, “Strictly proper scoring rules, prediction, and estimation,” Journal of the American Statistical Association , vol. 102, no. 477, pp. 359–378, Mar. 2007

work page 2007

[7] [7]

Probabilistic prediction methods for nonlinear systems with application to stochastic model predictive control,

D. Landgraf, A. Völz, F. Berkel, K. Schmidt, T. Specker, and K. Graichen, “Probabilistic prediction methods for nonlinear systems with application to stochastic model predictive control,” Annual Reviews in Control, vol. 56, Jan. 2023

work page 2023

[8] [8]

Uncertain systems,

D. Hinrichsen and A. J. Pritchard, “Uncertain systems,” in Mathemat- ical Systems Theory I: Modelling, State Space Analysis, Stability and Robustness, D. Hinrichsen and A. J. Pritchard, Eds. Berlin, Heidelberg: Springer, 2005, pp. 517–713

work page 2005

[9] [9]

Schmüdgen, The Moment Problem, ser

K. Schmüdgen, The Moment Problem, ser. Graduate Texts in Mathemat- ics. Cham: Springer International Publishing, 2017, vol. 277

work page 2017

[10] [10]

Distributionally robust optimization under mo- ment uncertainty with application to data-driven problems,

E. Delage and Y . Ye, “Distributionally robust optimization under mo- ment uncertainty with application to data-driven problems,” Operations Research, Jan. 2010

work page 2010

[11] [11]

P. S. Maybeck, Stochastic Models, Estimation, and Control . Academic press, 1982

work page 1982

[12] [12]

An efficient implementation of the second order extended kalman filter,

M. Roth and F. Gustafsson, “An efficient implementation of the second order extended kalman filter,” in 14th International Conference on Information Fusion, Jul. 2011, pp. 1–6

work page 2011

[13] [13]

New developments in state estimation for nonlinear systems,

M. Nørgaard, N. K. Poulsen, and O. Ravn, “New developments in state estimation for nonlinear systems,”Automatica, vol. 36, no. 11, pp. 1627– 1638, Nov. 2000

work page 2000

[14] [14]

A systematization of the unscented kalman filter theory,

H. M. T. Menegaz, J. Y . Ishihara, G. A. Borges, and A. N. Vargas, “A systematization of the unscented kalman filter theory,” IEEE Transac- tions on Automatic Control , vol. 60, no. 10, pp. 2583–2598, Oct. 2015

work page 2015

[15] [15]

Derivative-free estimation methods: New results and performance analysis,

M. Šimandl and J. Duník, “Derivative-free estimation methods: New results and performance analysis,” Automatica, vol. 45, no. 7, pp. 1749– 1757, Jul. 2009

work page 2009

[16] [16]

Recursive bayesian estimation using gaussian sums,

H. W. Sorenson and D. L. Alspach, “Recursive bayesian estimation using gaussian sums,” Automatica, vol. 7, no. 4, pp. 465–479, Jul. 1971

work page 1971

[17] [17]

Nonlinear bayesian estimation using gaus- sian sum approximations,

D. Alspach and H. Sorenson, “Nonlinear bayesian estimation using gaus- sian sum approximations,” IEEE Transactions on Automatic Control , vol. 17, no. 4, pp. 439–448, Aug. 1972

work page 1972

[18] [18]

Mixture kalman filters,

R. Chen and J. S. Liu, “Mixture kalman filters,” Journal of the Royal Statistical Society Series B: Statistical Methodology , vol. 62, no. 3, pp. 493–508, Sep. 2000

work page 2000

[19] [19]

The homogeneous chaos,

N. Wiener, “The homogeneous chaos,” American Journal of Mathemat- ics, vol. 60, no. 4, pp. 897–936, 1938

work page 1938

[20] [20]

An efficient method for stochastic optimal control with joint chance constraints for nonlinear systems,

J. A. Paulson and A. Mesbah, “An efficient method for stochastic optimal control with joint chance constraints for nonlinear systems,” International Journal of Robust and Nonlinear Control , vol. 29, no. 15, pp. 5017–5037, 2019

work page 2019

[21] [21]

Särkkä and L

S. Särkkä and L. Svensson, Bayesian Filtering and Smoothing , 2nd ed., ser. Institute of Mathematical Statistics Textbooks. Cambridge: Cam- bridge University Press, 2023

work page 2023

[22] [22]

Modern monte carlo methods for efficient uncertainty quan- tification and propagation: A survey,

J. Zhang, “Modern monte carlo methods for efficient uncertainty quan- tification and propagation: A survey,” WIREs Computational Statistics , vol. 13, no. 5, p. e1539, 2021

work page 2021

[23] [23]

Stochastic linear model predictive control with chance constraints – a review,

M. Farina, L. Giulioni, and R. Scattolini, “Stochastic linear model predictive control with chance constraints – a review,”Journal of Process Control, vol. 44, pp. 53–67, Aug. 2016

work page 2016

[24] [24]

Stochastic model predictive control: An overview and perspectives for future research,

A. Mesbah, “Stochastic model predictive control: An overview and perspectives for future research,” IEEE Control Systems Magazine , vol. 36, no. 6, pp. 30–44, Dec. 2016

work page 2016

[25] [25]

Nonlinear stochastic model predictive control: Existence, measurability, and stochastic asymptotic stability,

R. D. McAllister and J. B. Rawlings, “Nonlinear stochastic model predictive control: Existence, measurability, and stochastic asymptotic stability,” IEEE Transactions on Automatic Control , vol. 68, no. 3, pp. 1524–1536, Mar. 2023

work page 2023

[26] [26]

Distributionally robust chance constrained data-enabled predictive control,

J. Coulson, J. Lygeros, and F. Dörfler, “Distributionally robust chance constrained data-enabled predictive control,” IEEE Transactions on Automatic Control, vol. 67, no. 7, pp. 3289–3304, Jul. 2022

work page 2022

[27] [27]

C. P. Robert, The Bayesian Choice , ser. Springer Texts in Statistics. New York, NY: Springer, 2007

work page 2007

[28] [28]

Approximate bayesian forecasting,

D. T. Frazier, W. Maneesoonthorn, G. M. Martin, and B. P. M. McCabe, “Approximate bayesian forecasting,” International Journal of Forecast- ing, vol. 35, no. 2, pp. 521–539, Apr. 2019

work page 2019

[29] [29]

Quantile regression: 40 years on,

R. Koenker, “Quantile regression: 40 years on,” Annual Review of Economics, vol. 9, no. V olume 9, 2017, pp. 155–176, Aug. 2017

work page 2017

[30] [30]

Distributional regression forests for probabilistic precipitation forecasting in complex terrain,

L. Schlosser, T. Hothorn, R. Stauffer, and A. Zeileis, “Distributional regression forests for probabilistic precipitation forecasting in complex terrain,” The Annals of Applied Statistics , vol. 13, no. 3, pp. 1564–1589, Sep. 2019

work page 2019

[31] [31]

NGBoost: Natural gradient boosting for probabilistic prediction,

T. Duan, A. Anand, D. Y . Ding, K. K. Thai, S. Basu, A. Ng, and A. Schuler, “NGBoost: Natural gradient boosting for probabilistic prediction,” in Proceedings of the 37th International Conference on Machine Learning. PMLR, Nov. 2020, pp. 2690–2700

work page 2020

[32] [32]

DeepAR: Probabilistic forecasting with autoregressive recurrent networks,

D. Salinas, V . Flunkert, J. Gasthaus, and T. Januschowski, “DeepAR: Probabilistic forecasting with autoregressive recurrent networks,” Inter- national Journal of Forecasting , vol. 36, no. 3, pp. 1181–1191, Jul. 2020

work page 2020

[33] [33]

A review of predictive uncertainty estimation with machine learning,

H. Tyralis and G. Papacharalampous, “A review of predictive uncertainty estimation with machine learning,”Artificial Intelligence Review, vol. 57, no. 4, p. 94, Mar. 2024

work page 2024

[34] [34]

H. E. Scarf, K. J. Arrow, and S. Karlin, A Min-Max Solution of an Inventory Problem. Rand Corporation Santa Monica, 1957

work page 1957

[35] [35]

The distribution free newsboy problem: Review and extensions,

G. Gallego and I. Moon, “The distribution free newsboy problem: Review and extensions,” Journal of the Operational Research Society , vol. 44, no. 8, pp. 825–834, Aug. 1993

work page 1993

[36] [36]

Proper local scoring rules,

M. Parry, A. P. Dawid, and S. Lauritzen, “Proper local scoring rules,” The Annals of Statistics , vol. 40, no. 1, Feb. 2012

work page 2012

[37] [37]

Distributionally robust optimization: A review on theory and applications,

F. Lin, X. Fang, and Z. Gao, “Distributionally robust optimization: A review on theory and applications,” Numerical Algebra, Control and Optimization, vol. 12, no. 1, pp. 159–212, 2022

work page 2022

[38] [38]

R. S. Sutton and A. G. Barto, Reinforcement learning: An introduction . MIT press, 2018

work page 2018

[39] [39]

Dynamic programming,

R. Bellman, “Dynamic programming,” Science, vol. 153, no. 3731, pp. 34–37, Jul. 1966

work page 1966

[40] [40]

Checking local optimality in con- strained quadratic programming is np-hard,

P. M. Pardalos and G. Schnitger, “Checking local optimality in con- strained quadratic programming is np-hard,” Operations Research Let- ters, vol. 7, no. 1, pp. 33–35, Feb. 1988

work page 1988

[41] [41]

What is local optimality in nonconvex-nonconcave minimax optimization?

C. Jin, P. Netrapalli, and M. Jordan, “What is local optimality in nonconvex-nonconcave minimax optimization?” in Proceedings of the 37th International Conference on Machine Learning . PMLR, Nov. 2020, pp. 4880–4889

work page 2020

[42] [42]

The complexity of constrained min-max optimization,

C. Daskalakis, S. Skoulakis, and M. Zampetakis, “The complexity of constrained min-max optimization,” in Proceedings of the 53rd Annual ACM SIGACT Symposium on Theory of Computing , ser. STOC 2021. New York, NY , USA: Association for Computing Machinery, Jun. 2021, pp. 1466–1478

work page 2021

[43] [43]

Limiting behaviors of nonconvex-nonconcave minimax optimization via continuous-time systems,

B. Grimmer, H. Lu, P. Worah, and V . Mirrokni, “Limiting behaviors of nonconvex-nonconcave minimax optimization via continuous-time systems,” in Proceedings of The 33rd International Conference on Algorithmic Learning Theory . PMLR, Mar. 2022, pp. 465–487

work page 2022

[44] [44]

Implicit learning dynamics in stack- elberg games: Equilibria characterization, convergence analysis, and empirical study,

T. Fiez, B. Chasnov, and L. Ratliff, “Implicit learning dynamics in stack- elberg games: Equilibria characterization, convergence analysis, and empirical study,” in Proceedings of the 37th International Conference on Machine Learning . PMLR, Nov. 2020, pp. 3133–3144. APPENDIX A DISCUSSION ON CANONICALIZATION If we align pξk with pwk and optimize over pwk,...

work page 2020

[45] [45]

= 1, Ew∼pwk [w] = µk γ3,k ¯Σk ⪯ Ew∼pwk h (w−¯µk) (w−¯µk)T i ⪯ γ2,k ¯Σk ¯Σk (µk − ¯µk) (µk − ¯µk)⊤ γ1,k ⪰ 0 pwk(w) ≥ 0 ∀w ∈ Rdx . (24) The objective is minimized when E h (wk −¯µk) (wk −¯µk)T i equals γ2,k ¯Σk, which means the solution for p∗ wk is the set n pwk | Ew∼pwk h (w−¯µk) (w−¯µk)T i = γ2,k ¯Σk o , and the objective value at (ˆp∗ k, Φ∗ k) is −1 2 d...

work page 2013