pith. sign in

arxiv: 2605.03845 · v1 · submitted 2026-05-05 · 📡 eess.SY · cs.SY

Sinkhorn Ambiguity Sets for Distributionally Robust Control: Convexity, Weak Compactness, and Tractability

Pith reviewed 2026-05-07 13:50 UTC · model grok-4.3

classification 📡 eess.SY cs.SY
keywords distributionally robust controlSinkhorn discrepancyambiguity setslinear quadratic controlconvex programmingsafety constraintstrajectory planning
0
0 comments X

The pith

Sinkhorn ambiguity sets turn the distributionally robust linear quadratic control problem over linear policies into a convex program, even with safety constraints.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

Classical stochastic control assumes complete knowledge of uncertainty, but real systems often have only partial information from limited samples. The paper uses Sinkhorn discrepancy to define ambiguity sets that blend observed data with a reference distribution and avoid forcing worst-case distributions to be discrete. Under standard assumptions, these sets are shown to be convex and weakly compact. This property is leveraged to reformulate the distributionally robust linear quadratic control problem over linear policies as a convex optimization task that remains tractable when distributionally robust safety constraints are added.

Core claim

We establish convexity and weak compactness of Sinkhorn ambiguity sets under standard assumptions. We then leverage these results to prove that the Sinkhorn DR linear quadratic control problem over linear policies can be solved through convex programming, even in the presence of DR safety constraints. The findings are validated on a trajectory planning example.

What carries the argument

Sinkhorn ambiguity sets defined via the Sinkhorn discrepancy, which model uncertainty while preserving convexity for control optimization.

If this is right

  • The Sinkhorn DR linear quadratic control problem over linear policies admits a convex programming formulation.
  • This convex formulation continues to hold when distributionally robust safety constraints are included.
  • The approach remains effective when only a small number of noise samples are available by incorporating a reference distribution.
  • Worst-case distributions need not be restricted to discrete supports, unlike some other optimal transport distances.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • Similar convexity arguments might apply to other control problems or policy structures if the underlying ambiguity sets preserve the same properties.
  • The method could support more realistic robustness analysis in systems where uncertainty is continuous rather than discrete.
  • Blending data and priors through the reference distribution may help in online or adaptive control settings with streaming observations.

Load-bearing premise

Standard assumptions hold under which Sinkhorn ambiguity sets remain convex and weakly compact.

What would settle it

A concrete linear-quadratic instance with linear policies where the Sinkhorn DR problem, including safety constraints, cannot be cast as a convex program despite the stated assumptions on the ambiguity sets.

Figures

Figures reproduced from arXiv: 2605.03845 by Andrea Martin, Giancarlo Ferrari-Trecate, Riccardo Cescon.

Figure 1
Figure 1. Figure 1: Visualization of worst-case distributions from Wasserstein DRO (top view at source ↗
Figure 2
Figure 2. Figure 2: CVaR is the expected value of the γ-right tail (dark blue). constrained control problem can then be expressed as π ⋆ ∈ arg min π=(π0,...,πT−1) EP[J(π, w)] (8a) subject to CVaRP γ (max{h(x), g(u)}) ≤ 0 . (8b) B. Approximations and problem formulation Problem (8) is rarely solvable directly. Indeed, most of the time, the true underlying probability P of w is unknown. Instead, we assume to have access to N ∈ … view at source ↗
Figure 3
Figure 3. Figure 3: Comparison in cost (left) and violation (right) percentage increase of view at source ↗
Figure 4
Figure 4. Figure 4: Terminal state of different trajectories given by the policies view at source ↗
read the original abstract

Classical stochastic control assumes perfect knowledge of the uncertainty affecting the plant. In practice, however, such information is often incomplete. To address this limitation, we consider a distributionally robust control (DRC) problem with ambiguity sets defined via the Sinkhorn discrepancy. Compared to other discrepancy measures based on optimal transport, such as the popular Wasserstein distance, the Sinkhorn divergence does not constrain the worst-case distribution to be discrete, and allows combining observed data with prior knowledge in the form of a reference distribution, making this choice particularly suitable when only few noise samples are available for control design. We first study the properties of Sinkhorn ambiguity sets, establishing convexity and weak compactness under standard assumptions. We then leverage these results to prove that, the Sinkhorn DR linear quadratic control problem over linear policies can be solved through convex programming-even in the presence of DR safety constraints. Finally, we validate our theoretical findings and demonstrate the effectiveness of the proposed approach on a trajectory planning example.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. The manuscript introduces Sinkhorn ambiguity sets for distributionally robust control (DRC). It establishes convexity and weak compactness of these sets under standard assumptions, then proves that the Sinkhorn DR linear quadratic control problem over linear policies (including with DR safety constraints) reduces to convex programming. The approach is illustrated via a trajectory planning example.

Significance. If the convexity, weak compactness, and resulting tractability hold, the work provides a data-efficient DRC framework that supports continuous worst-case distributions and prior incorporation, offering advantages over Wasserstein ambiguity sets in low-sample regimes. The explicit convex reformulation for LQ problems with safety constraints could enable practical robust controller synthesis.

major comments (2)
  1. [Section 3] Section 3 (properties of Sinkhorn ambiguity sets): The claims of convexity and weak compactness 'under standard assumptions' are load-bearing for the minimax interchange and convex reformulation in Section 4. The manuscript invokes these properties for the reference measure P (empirical + prior mixture) on a Polish space but does not spell out the precise conditions (e.g., moment bounds, support restrictions, or topology) that ensure the sets remain convex and weakly compact when the cost is quadratic and the constraints are distributionally robust chance constraints. Without this, it is unclear whether compactness holds in the weak topology relevant to the LQ setting.
  2. [Section 4] Section 4 (DR LQ control formulation): The reduction to convex programming relies on the ambiguity-set properties from Section 3 to justify interchanging min and max. If the weak compactness fails for the specific quadratic cost and chance-constraint structure, the convex reformulation (and its tractability) no longer follows directly; the paper should either derive the required conditions explicitly or provide a self-contained verification for the LQ case.
minor comments (2)
  1. [Abstract] Abstract: The numerical validation on the trajectory planning example is mentioned without reference to specific baselines, quantitative metrics, or error analysis, which would help readers assess practical gains.
  2. [Section 3] Notation: The definition of the Sinkhorn divergence and the precise form of the ambiguity set {Q : Sinkhorn(Q,P) ≤ ε} should be restated explicitly at the start of Section 3 for self-contained reading.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the careful reading and constructive comments on the properties of Sinkhorn ambiguity sets and their role in the distributionally robust LQ control formulation. We agree that the assumptions underlying convexity and weak compactness require more explicit statement to support the minimax interchange and convex reformulation, particularly for quadratic costs and distributionally robust chance constraints. We respond to each major comment below and will incorporate clarifications in the revised manuscript.

read point-by-point responses
  1. Referee: [Section 3] Section 3 (properties of Sinkhorn ambiguity sets): The claims of convexity and weak compactness 'under standard assumptions' are load-bearing for the minimax interchange and convex reformulation in Section 4. The manuscript invokes these properties for the reference measure P (empirical + prior mixture) on a Polish space but does not spell out the precise conditions (e.g., moment bounds, support restrictions, or topology) that ensure the sets remain convex and weakly compact when the cost is quadratic and the constraints are distributionally robust chance constraints. Without this, it is unclear whether compactness holds in the weak topology relevant to the LQ setting.

    Authors: We thank the referee for this observation. Convexity of the Sinkhorn ambiguity sets follows immediately from the joint convexity of the Sinkhorn divergence in its two arguments, which holds on any Polish space without further restrictions. Weak compactness, however, does require additional structure on the reference measure. In the revision we will replace the phrase 'under standard assumptions' with an explicit list: the underlying space is Polish and metrized by a distance inducing the weak topology; the reference measure P (empirical-plus-prior mixture) possesses finite second moments; and the Sinkhorn radius is chosen so that the ambiguity set is tight (invoking Prokhorov's theorem). We will also add a short remark verifying that these conditions are compatible with the quadratic stage cost and with the distributionally robust chance constraints, which remain upper semicontinuous under weak convergence when second moments are controlled. revision: yes

  2. Referee: [Section 4] Section 4 (DR LQ control formulation): The reduction to convex programming relies on the ambiguity-set properties from Section 3 to justify interchanging min and max. If the weak compactness fails for the specific quadratic cost and chance-constraint structure, the convex reformulation (and its tractability) no longer follows directly; the paper should either derive the required conditions explicitly or provide a self-contained verification for the LQ case.

    Authors: We agree that the justification for the minimax interchange should be self-contained for the linear-quadratic setting. The revised manuscript will contain a dedicated lemma in Section 4 that (i) confirms weak compactness of the Sinkhorn ambiguity set under the quadratic cost (using the finite-second-moment condition stated in the updated Section 3) and (ii) shows that the distributionally robust chance constraints are closed in the weak topology. With these two facts, the standard minimax theorem for convex-concave problems on compact sets applies directly, yielding the convex program without additional assumptions. We will also include a brief proof sketch that the linear policy parameterization preserves convexity of the resulting finite-dimensional program. revision: yes

Circularity Check

0 steps flagged

No circularity: properties established independently before use in control result

full rationale

The manuscript states it first studies and establishes convexity and weak compactness of the Sinkhorn ambiguity sets under standard assumptions, then leverages those results to obtain the convex-programming reformulation of the DR LQ control problem (including safety constraints). This sequence is self-contained within the paper rather than reducing any claim to a fitted parameter, self-definition, or unverified self-citation. The central tractability result follows from the newly established set properties without the prediction equaling its inputs by construction. No load-bearing step matches the enumerated circularity patterns.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The central claims rest on unstated standard assumptions for Sinkhorn divergence properties and control-theoretic regularity conditions; no free parameters or invented entities are introduced in the abstract.

axioms (1)
  • domain assumption standard assumptions under which Sinkhorn ambiguity sets are convex and weakly compact
    Invoked to establish the main properties and tractability result.

pith-pipeline@v0.9.0 · 5477 in / 1012 out tokens · 40434 ms · 2026-05-07T13:50:31.391130+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

47 extracted references · 47 canonical work pages

  1. [1]

    Bertsekas,Dynamic programming and optimal control: Volume I

    D. Bertsekas,Dynamic programming and optimal control: Volume I. Athena scientific, 2012, vol. 4

  2. [2]

    Zhou and J

    K. Zhou and J. Doyle,Essentials of robust control. Upper Saddle River: Prentice-Hall, 1998

  3. [3]

    LQG control with anH ∞ performance bound: A Riccati equation approach,

    D. S. Bernstein and W. M. Haddad, “LQG control with anH ∞ performance bound: A Riccati equation approach,” in1988 American Control Conference. IEEE, 1988, pp. 796–802

  4. [4]

    Optimal control with mixed H2 andH ∞ performance objectives,

    J. Doyle, K. Zhou, and B. Bodenheimer, “Optimal control with mixed H2 andH ∞ performance objectives,” in1989 American Control Conference. IEEE, 1989, pp. 2065–2070

  5. [5]

    On the guarantees of minimizing regret in receding horizon,

    A. Martin, L. Furieri, F. D ¨orfler, J. Lygeros, and G. Ferrari-Trecate, “On the guarantees of minimizing regret in receding horizon,”IEEE Transactions on Automatic Control, vol. 70, no. 3, pp. 1547–1562, 2025

  6. [6]

    Regret-optimal estimation and control,

    G. Goel and B. Hassibi, “Regret-optimal estimation and control,”IEEE Transactions on Automatic Control, vol. 68, no. 5, pp. 3041–3053, 2023

  7. [7]

    Regret optimal control for uncertain stochastic systems,

    A. Martin, L. Furieri, F. D ¨orfler, J. Lygeros, and G. Ferrari-Trecate, “Regret optimal control for uncertain stochastic systems,”European Journal of Control, vol. 80, p. 101051, 2024

  8. [8]

    Distributionally robust control of constrained stochastic systems,

    B. P. Van Parys, D. Kuhn, P. J. Goulart, and M. Morari, “Distributionally robust control of constrained stochastic systems,”IEEE Transactions on Automatic Control, vol. 61, no. 2, pp. 430–442, 2015

  9. [9]

    Distributionally robust LQG control under distributed uncertainty,

    L. Falconi, A. Ferrante, and M. Zorzi, “Distributionally robust LQG control under distributed uncertainty,”Automatica, vol. 174, p. 112128, 2025

  10. [10]

    Minimax optimal control of stochastic uncertain systems with relative entropy constraints,

    I. R. Petersen, M. R. James, and P. Dupuis, “Minimax optimal control of stochastic uncertain systems with relative entropy constraints,”IEEE Transactions on Automatic Control, vol. 45, no. 3, pp. 398–412, 2000

  11. [11]

    Distributionally robust LQG with Kullback-Leibler ambiguity sets,

    M. Fochesato, L. Falconi, M. Zorzi, A. Ferrante, and J. Lygeros, “Distributionally robust LQG with Kullback-Leibler ambiguity sets,” arXiv preprint arXiv:2505.08370, 2025

  12. [12]

    Data-driven distributionally robust optimization using the Wasserstein metric: Performance guarantees and tractable reformulations,

    P. Mohajerin Esfahani and D. Kuhn, “Data-driven distributionally robust optimization using the Wasserstein metric: Performance guarantees and tractable reformulations,”Mathematical Programming, vol. 171, no. 1, pp. 115–166, 2018

  13. [13]

    Distributionally robust linear quadratic control,

    B. Taskesen, D. Iancu, C ¸ . Koc ¸yi ˘git, and D. Kuhn, “Distributionally robust linear quadratic control,”Advances in Neural Information Pro- cessing Systems, vol. 36, 2024

  14. [14]

    Optimality of linear policies for distributionally robust linear quadratic gaussian regulator with stationary distributions,

    N. Lanzetti, A. Terpin, and F. D ¨orfler, “Optimality of linear policies for distributionally robust linear quadratic gaussian regulator with stationary distributions,”arXiv preprint arXiv:2410.22826, 2024

  15. [15]

    Wasserstein tube MPC with exact uncertainty propagation,

    L. Aolaritei, M. Fochesato, J. Lygeros, and F. D ¨orfler, “Wasserstein tube MPC with exact uncertainty propagation,” in2023 62nd IEEE Conference on Decision and Control (CDC). IEEE, 2023, pp. 2036– 2041

  16. [16]

    Distributionally robust chance constrained data-enabled predictive control,

    J. Coulson, J. Lygeros, and F. D ¨orfler, “Distributionally robust chance constrained data-enabled predictive control,”IEEE Transactions on Automatic Control, vol. 67, no. 7, pp. 3289–3304, 2021

  17. [17]

    Data-driven distributionally robust MPC for systems with uncertain dynamics,

    F. Micheli, T. Summers, and J. Lygeros, “Data-driven distributionally robust MPC for systems with uncertain dynamics,” in2022 IEEE 61st Conference on Decision and Control (CDC). IEEE, 2022, pp. 4788– 4793

  18. [18]

    Distributionally robust infinite-horizon control: from a pool of samples to the design of dependable controllers,

    J.-S. Brouillon, A. Martin, J. Lygeros, F. D ¨orfler, and G. Ferrari- Trecate, “Distributionally robust infinite-horizon control: from a pool of samples to the design of dependable controllers,”IEEE Transactions on Automatic Control, vol. 70, no. 10, pp. 6465–6480, 2025

  19. [19]

    Distributionally robust stochastic optimiza- tion with Wasserstein distance,

    R. Gao and A. Kleywegt, “Distributionally robust stochastic optimiza- tion with Wasserstein distance,”Mathematics of Operations Research, vol. 48, no. 2, pp. 603–655, 2023

  20. [20]

    Sinkhorn distances: Lightspeed computation of optimal transport,

    M. Cuturi, “Sinkhorn distances: Lightspeed computation of optimal transport,”Advances in neural information processing systems, vol. 26, 2013

  21. [21]

    Regularization for Wasserstein distributionally robust optimization,

    W. Azizian, F. Iutzeler, and J. Malick, “Regularization for Wasserstein distributionally robust optimization,”ESAIM: Control, Optimisation and Calculus of Variations, vol. 29, p. 33, 2023

  22. [22]

    Unifying distribution- ally robust optimization via optimal transport theory,

    J. Blanchet, D. Kuhn, J. Li, and B. Taskesen, “Unifying distribution- ally robust optimization via optimal transport theory,”arXiv preprint arXiv:2308.05414, 2023

  23. [23]

    Entropy-regularized Wasserstein distributionally robust shape and topology optimization,

    C. Dapogny, F. Iutzeler, A. Meda, and B. Thibert, “Entropy-regularized Wasserstein distributionally robust shape and topology optimization,” Structural and Multidisciplinary Optimization, vol. 66, no. 3, p. 42, 2023

  24. [24]

    Data-driven Distribution- ally Robust Control Based on Sinkhorn Ambiguity Sets,

    R. Cescon, A. Martin, and G. Ferrari-Trecate, “Data-driven Distribution- ally Robust Control Based on Sinkhorn Ambiguity Sets,” in2025 IEEE 64th Conference on Decision and Control (CDC). IEEE, 2025, pp. 4708–4713

  25. [25]

    Sinkhorn distributionally robust state estimation via system level synthesis,

    Y . Feng, X. Li, S. X. Ding, H. Ye, and C. Shang, “Sinkhorn Distri- butionally Robust State Estimation via System Level Synthesis,”arXiv preprint arXiv:2602.08018, 2026

  26. [26]

    Distributionally robust optimization,

    D. Kuhn, S. Shafiee, and W. Wiesemann, “Distributionally robust optimization,”Acta Numerica, vol. 34, pp. 579–804, 2025

  27. [27]

    Sinkhorn distributionally robust optimiza- tion,

    J. Wang, R. Gao, and Y . Xie, “Sinkhorn distributionally robust optimiza- tion,”Operations Research, 2025

  28. [28]

    Villaniet al.,Optimal transport: old and new

    C. Villaniet al.,Optimal transport: old and new. Springer, 2009, vol. 338

  29. [29]

    Kallenberg,Foundations of modern probability, 2nd ed

    O. Kallenberg,Foundations of modern probability, 2nd ed. Springer- Verlag, New York, 2002

  30. [30]

    Wasserstein distributionally robust optimization: Theory and appli- cations in machine learning,

    D. Kuhn, P. M. Esfahani, V . A. Nguyen, and S. Shafieezadeh-Abadeh, “Wasserstein distributionally robust optimization: Theory and appli- cations in machine learning,” inOperations research & management science in the age of analytics. Informs, 2019, pp. 130–166

  31. [31]

    New perspectives on regularization and computation in optimal transport-based distributionally robust optimization,

    S. Shafieezadeh-Abadeh, L. Aolaritei, F. D ¨orfler, and D. Kuhn, “New perspectives on regularization and computation in optimal transport-based distributionally robust optimization,”arXiv preprint arXiv:2303.03900, 2023

  32. [32]

    On linear optimization over Wasserstein balls,

    M.-C. Yue, D. Kuhn, and W. Wiesemann, “On linear optimization over Wasserstein balls,”Mathematical Programming, vol. 195, no. 1, pp. 1107–1122, 2022

  33. [33]

    Optimization of conditional value- at-risk,

    R. T. Rockafellar and S. Uryasev, “Optimization of conditional value- at-risk,”Journal of risk, vol. 2, pp. 21–42, 2000

  34. [34]

    On the global optimality of linear policies for Sinkhorn distributionally robust linear quadratic control,

    R. Cescon, A. Martin, and G. Ferrari-Trecate, “On the global optimality of linear policies for Sinkhorn distributionally robust linear quadratic control,”arXiv preprint arXiv:2509.00956, 2025

  35. [35]

    A design of discrete-time integral controllers with computation delays via loop transfer recovery,

    T. Ishihara, H.-J. Guo, and H. Takeda, “A design of discrete-time integral controllers with computation delays via loop transfer recovery,” Automatica, vol. 28, no. 3, pp. 599–603, 1992. [36]U.S. Military Handbook, U.S. Department of Defense, 1997

  36. [36]

    Creating a unified graphical wind turbulence model from multiple specifications,

    S. Gage, “Creating a unified graphical wind turbulence model from multiple specifications,” inAIAA Modeling and simulation technologies conference and exhibit, 2003, p. 5529

  37. [37]

    R. T. Rockafellar and R. J.-B. Wets,Variational analysis. Springer Science & Business Media, 2009, vol. 317

  38. [38]

    Billingsley,Convergence of probability measures

    P. Billingsley,Convergence of probability measures. John Wiley & Sons, 2013

  39. [39]

    Munkres,Topology, ser

    J. Munkres,Topology, ser. Featured Titles for Topology. Prentice Hall, Incorporated, 2000

  40. [40]

    Entropy-regularized 2- Wasserstein distance between Gaussian measures,

    A. Mallasto, A. Gerolin, and H. Q. Minh, “Entropy-regularized 2- Wasserstein distance between Gaussian measures,”Information Geom- etry, vol. 5, no. 1, pp. 289–323, 2022

  41. [41]

    Lecture notes on information theory,

    Y . Polyanskiy and Y . Wu, “Lecture notes on information theory,”Lecture Notes for ECE563 (UIUC) and, vol. 6, no. 2012-2016, p. 7, 2014

  42. [42]

    Wasserstein metric and subordination,

    P. Cl ´ement and W. Desch, “Wasserstein metric and subordination,” Studia Mathematica, vol. 1, no. 189, pp. 35–52, 2008

  43. [43]

    A system-level approach to controller synthesis,

    Y .-S. Wang, N. Matni, and J. C. Doyle, “A system-level approach to controller synthesis,”IEEE Transactions on Automatic Control, vol. 64, no. 10, pp. 4079–4093, 2019

  44. [44]

    On general minimax theorems,

    M. Sion, “On general minimax theorems,”Pacific Journal of Mathemat- ics, vol. 8, no. 1, pp. 171–176, 1958

  45. [45]

    escaping

    S. Boyd and L. Vandenberghe,Convex optimization. Cambridge university press, 2004. APPENDIXI AUXILIARYRESULTS We review some well-known facts from measure theory and variational analysis that are used to prove our results. Later in the section we also present a brief overview of the topology of the Wasserstein space. Definition 6(Lower limit and lower sem...

  46. [46]

    The affine subspace defined by I−ZA−ZB Φxx Φxy Φux Φuy = I0 ,(23a) Φxx Φxy Φux Φuy I−ZA −C = I 0 ,(23b) I−ZA−ZB ϕx ϕu = 0,(23c) parameterizes all possible system responses(20)

  47. [47]

    Proof.Proof of part 1

    For any vectors{ϕ x,ϕ u}and block lower-triangular ma- trices{Φ xx,Φ xy,Φ ux,Φ uy}satisfying(23), the affine controller in(10)withK=Φ uy −Φ uxΦ−1 xx Φxy and v=ϕ u −Φ uyCϕx +Φ uxΦ−1 xx ΦxyCϕx achieves the desired response. Proof.Proof of part 1. LetKbe any block lower-triangular operator andvany real vector. We verify (23a) by using the definitions ofΦ xx ...