Sinkhorn Ambiguity Sets for Distributionally Robust Control: Convexity, Weak Compactness, and Tractability
Pith reviewed 2026-05-07 13:50 UTC · model grok-4.3
The pith
Sinkhorn ambiguity sets turn the distributionally robust linear quadratic control problem over linear policies into a convex program, even with safety constraints.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
We establish convexity and weak compactness of Sinkhorn ambiguity sets under standard assumptions. We then leverage these results to prove that the Sinkhorn DR linear quadratic control problem over linear policies can be solved through convex programming, even in the presence of DR safety constraints. The findings are validated on a trajectory planning example.
What carries the argument
Sinkhorn ambiguity sets defined via the Sinkhorn discrepancy, which model uncertainty while preserving convexity for control optimization.
If this is right
- The Sinkhorn DR linear quadratic control problem over linear policies admits a convex programming formulation.
- This convex formulation continues to hold when distributionally robust safety constraints are included.
- The approach remains effective when only a small number of noise samples are available by incorporating a reference distribution.
- Worst-case distributions need not be restricted to discrete supports, unlike some other optimal transport distances.
Where Pith is reading between the lines
- Similar convexity arguments might apply to other control problems or policy structures if the underlying ambiguity sets preserve the same properties.
- The method could support more realistic robustness analysis in systems where uncertainty is continuous rather than discrete.
- Blending data and priors through the reference distribution may help in online or adaptive control settings with streaming observations.
Load-bearing premise
Standard assumptions hold under which Sinkhorn ambiguity sets remain convex and weakly compact.
What would settle it
A concrete linear-quadratic instance with linear policies where the Sinkhorn DR problem, including safety constraints, cannot be cast as a convex program despite the stated assumptions on the ambiguity sets.
Figures
read the original abstract
Classical stochastic control assumes perfect knowledge of the uncertainty affecting the plant. In practice, however, such information is often incomplete. To address this limitation, we consider a distributionally robust control (DRC) problem with ambiguity sets defined via the Sinkhorn discrepancy. Compared to other discrepancy measures based on optimal transport, such as the popular Wasserstein distance, the Sinkhorn divergence does not constrain the worst-case distribution to be discrete, and allows combining observed data with prior knowledge in the form of a reference distribution, making this choice particularly suitable when only few noise samples are available for control design. We first study the properties of Sinkhorn ambiguity sets, establishing convexity and weak compactness under standard assumptions. We then leverage these results to prove that, the Sinkhorn DR linear quadratic control problem over linear policies can be solved through convex programming-even in the presence of DR safety constraints. Finally, we validate our theoretical findings and demonstrate the effectiveness of the proposed approach on a trajectory planning example.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript introduces Sinkhorn ambiguity sets for distributionally robust control (DRC). It establishes convexity and weak compactness of these sets under standard assumptions, then proves that the Sinkhorn DR linear quadratic control problem over linear policies (including with DR safety constraints) reduces to convex programming. The approach is illustrated via a trajectory planning example.
Significance. If the convexity, weak compactness, and resulting tractability hold, the work provides a data-efficient DRC framework that supports continuous worst-case distributions and prior incorporation, offering advantages over Wasserstein ambiguity sets in low-sample regimes. The explicit convex reformulation for LQ problems with safety constraints could enable practical robust controller synthesis.
major comments (2)
- [Section 3] Section 3 (properties of Sinkhorn ambiguity sets): The claims of convexity and weak compactness 'under standard assumptions' are load-bearing for the minimax interchange and convex reformulation in Section 4. The manuscript invokes these properties for the reference measure P (empirical + prior mixture) on a Polish space but does not spell out the precise conditions (e.g., moment bounds, support restrictions, or topology) that ensure the sets remain convex and weakly compact when the cost is quadratic and the constraints are distributionally robust chance constraints. Without this, it is unclear whether compactness holds in the weak topology relevant to the LQ setting.
- [Section 4] Section 4 (DR LQ control formulation): The reduction to convex programming relies on the ambiguity-set properties from Section 3 to justify interchanging min and max. If the weak compactness fails for the specific quadratic cost and chance-constraint structure, the convex reformulation (and its tractability) no longer follows directly; the paper should either derive the required conditions explicitly or provide a self-contained verification for the LQ case.
minor comments (2)
- [Abstract] Abstract: The numerical validation on the trajectory planning example is mentioned without reference to specific baselines, quantitative metrics, or error analysis, which would help readers assess practical gains.
- [Section 3] Notation: The definition of the Sinkhorn divergence and the precise form of the ambiguity set {Q : Sinkhorn(Q,P) ≤ ε} should be restated explicitly at the start of Section 3 for self-contained reading.
Simulated Author's Rebuttal
We thank the referee for the careful reading and constructive comments on the properties of Sinkhorn ambiguity sets and their role in the distributionally robust LQ control formulation. We agree that the assumptions underlying convexity and weak compactness require more explicit statement to support the minimax interchange and convex reformulation, particularly for quadratic costs and distributionally robust chance constraints. We respond to each major comment below and will incorporate clarifications in the revised manuscript.
read point-by-point responses
-
Referee: [Section 3] Section 3 (properties of Sinkhorn ambiguity sets): The claims of convexity and weak compactness 'under standard assumptions' are load-bearing for the minimax interchange and convex reformulation in Section 4. The manuscript invokes these properties for the reference measure P (empirical + prior mixture) on a Polish space but does not spell out the precise conditions (e.g., moment bounds, support restrictions, or topology) that ensure the sets remain convex and weakly compact when the cost is quadratic and the constraints are distributionally robust chance constraints. Without this, it is unclear whether compactness holds in the weak topology relevant to the LQ setting.
Authors: We thank the referee for this observation. Convexity of the Sinkhorn ambiguity sets follows immediately from the joint convexity of the Sinkhorn divergence in its two arguments, which holds on any Polish space without further restrictions. Weak compactness, however, does require additional structure on the reference measure. In the revision we will replace the phrase 'under standard assumptions' with an explicit list: the underlying space is Polish and metrized by a distance inducing the weak topology; the reference measure P (empirical-plus-prior mixture) possesses finite second moments; and the Sinkhorn radius is chosen so that the ambiguity set is tight (invoking Prokhorov's theorem). We will also add a short remark verifying that these conditions are compatible with the quadratic stage cost and with the distributionally robust chance constraints, which remain upper semicontinuous under weak convergence when second moments are controlled. revision: yes
-
Referee: [Section 4] Section 4 (DR LQ control formulation): The reduction to convex programming relies on the ambiguity-set properties from Section 3 to justify interchanging min and max. If the weak compactness fails for the specific quadratic cost and chance-constraint structure, the convex reformulation (and its tractability) no longer follows directly; the paper should either derive the required conditions explicitly or provide a self-contained verification for the LQ case.
Authors: We agree that the justification for the minimax interchange should be self-contained for the linear-quadratic setting. The revised manuscript will contain a dedicated lemma in Section 4 that (i) confirms weak compactness of the Sinkhorn ambiguity set under the quadratic cost (using the finite-second-moment condition stated in the updated Section 3) and (ii) shows that the distributionally robust chance constraints are closed in the weak topology. With these two facts, the standard minimax theorem for convex-concave problems on compact sets applies directly, yielding the convex program without additional assumptions. We will also include a brief proof sketch that the linear policy parameterization preserves convexity of the resulting finite-dimensional program. revision: yes
Circularity Check
No circularity: properties established independently before use in control result
full rationale
The manuscript states it first studies and establishes convexity and weak compactness of the Sinkhorn ambiguity sets under standard assumptions, then leverages those results to obtain the convex-programming reformulation of the DR LQ control problem (including safety constraints). This sequence is self-contained within the paper rather than reducing any claim to a fitted parameter, self-definition, or unverified self-citation. The central tractability result follows from the newly established set properties without the prediction equaling its inputs by construction. No load-bearing step matches the enumerated circularity patterns.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption standard assumptions under which Sinkhorn ambiguity sets are convex and weakly compact
Reference graph
Works this paper leans on
-
[1]
Bertsekas,Dynamic programming and optimal control: Volume I
D. Bertsekas,Dynamic programming and optimal control: Volume I. Athena scientific, 2012, vol. 4
work page 2012
-
[2]
K. Zhou and J. Doyle,Essentials of robust control. Upper Saddle River: Prentice-Hall, 1998
work page 1998
-
[3]
LQG control with anH ∞ performance bound: A Riccati equation approach,
D. S. Bernstein and W. M. Haddad, “LQG control with anH ∞ performance bound: A Riccati equation approach,” in1988 American Control Conference. IEEE, 1988, pp. 796–802
work page 1988
-
[4]
Optimal control with mixed H2 andH ∞ performance objectives,
J. Doyle, K. Zhou, and B. Bodenheimer, “Optimal control with mixed H2 andH ∞ performance objectives,” in1989 American Control Conference. IEEE, 1989, pp. 2065–2070
work page 1989
-
[5]
On the guarantees of minimizing regret in receding horizon,
A. Martin, L. Furieri, F. D ¨orfler, J. Lygeros, and G. Ferrari-Trecate, “On the guarantees of minimizing regret in receding horizon,”IEEE Transactions on Automatic Control, vol. 70, no. 3, pp. 1547–1562, 2025
work page 2025
-
[6]
Regret-optimal estimation and control,
G. Goel and B. Hassibi, “Regret-optimal estimation and control,”IEEE Transactions on Automatic Control, vol. 68, no. 5, pp. 3041–3053, 2023
work page 2023
-
[7]
Regret optimal control for uncertain stochastic systems,
A. Martin, L. Furieri, F. D ¨orfler, J. Lygeros, and G. Ferrari-Trecate, “Regret optimal control for uncertain stochastic systems,”European Journal of Control, vol. 80, p. 101051, 2024
work page 2024
-
[8]
Distributionally robust control of constrained stochastic systems,
B. P. Van Parys, D. Kuhn, P. J. Goulart, and M. Morari, “Distributionally robust control of constrained stochastic systems,”IEEE Transactions on Automatic Control, vol. 61, no. 2, pp. 430–442, 2015
work page 2015
-
[9]
Distributionally robust LQG control under distributed uncertainty,
L. Falconi, A. Ferrante, and M. Zorzi, “Distributionally robust LQG control under distributed uncertainty,”Automatica, vol. 174, p. 112128, 2025
work page 2025
-
[10]
Minimax optimal control of stochastic uncertain systems with relative entropy constraints,
I. R. Petersen, M. R. James, and P. Dupuis, “Minimax optimal control of stochastic uncertain systems with relative entropy constraints,”IEEE Transactions on Automatic Control, vol. 45, no. 3, pp. 398–412, 2000
work page 2000
-
[11]
Distributionally robust LQG with Kullback-Leibler ambiguity sets,
M. Fochesato, L. Falconi, M. Zorzi, A. Ferrante, and J. Lygeros, “Distributionally robust LQG with Kullback-Leibler ambiguity sets,” arXiv preprint arXiv:2505.08370, 2025
-
[12]
P. Mohajerin Esfahani and D. Kuhn, “Data-driven distributionally robust optimization using the Wasserstein metric: Performance guarantees and tractable reformulations,”Mathematical Programming, vol. 171, no. 1, pp. 115–166, 2018
work page 2018
-
[13]
Distributionally robust linear quadratic control,
B. Taskesen, D. Iancu, C ¸ . Koc ¸yi ˘git, and D. Kuhn, “Distributionally robust linear quadratic control,”Advances in Neural Information Pro- cessing Systems, vol. 36, 2024
work page 2024
-
[14]
N. Lanzetti, A. Terpin, and F. D ¨orfler, “Optimality of linear policies for distributionally robust linear quadratic gaussian regulator with stationary distributions,”arXiv preprint arXiv:2410.22826, 2024
-
[15]
Wasserstein tube MPC with exact uncertainty propagation,
L. Aolaritei, M. Fochesato, J. Lygeros, and F. D ¨orfler, “Wasserstein tube MPC with exact uncertainty propagation,” in2023 62nd IEEE Conference on Decision and Control (CDC). IEEE, 2023, pp. 2036– 2041
work page 2023
-
[16]
Distributionally robust chance constrained data-enabled predictive control,
J. Coulson, J. Lygeros, and F. D ¨orfler, “Distributionally robust chance constrained data-enabled predictive control,”IEEE Transactions on Automatic Control, vol. 67, no. 7, pp. 3289–3304, 2021
work page 2021
-
[17]
Data-driven distributionally robust MPC for systems with uncertain dynamics,
F. Micheli, T. Summers, and J. Lygeros, “Data-driven distributionally robust MPC for systems with uncertain dynamics,” in2022 IEEE 61st Conference on Decision and Control (CDC). IEEE, 2022, pp. 4788– 4793
work page 2022
-
[18]
J.-S. Brouillon, A. Martin, J. Lygeros, F. D ¨orfler, and G. Ferrari- Trecate, “Distributionally robust infinite-horizon control: from a pool of samples to the design of dependable controllers,”IEEE Transactions on Automatic Control, vol. 70, no. 10, pp. 6465–6480, 2025
work page 2025
-
[19]
Distributionally robust stochastic optimiza- tion with Wasserstein distance,
R. Gao and A. Kleywegt, “Distributionally robust stochastic optimiza- tion with Wasserstein distance,”Mathematics of Operations Research, vol. 48, no. 2, pp. 603–655, 2023
work page 2023
-
[20]
Sinkhorn distances: Lightspeed computation of optimal transport,
M. Cuturi, “Sinkhorn distances: Lightspeed computation of optimal transport,”Advances in neural information processing systems, vol. 26, 2013
work page 2013
-
[21]
Regularization for Wasserstein distributionally robust optimization,
W. Azizian, F. Iutzeler, and J. Malick, “Regularization for Wasserstein distributionally robust optimization,”ESAIM: Control, Optimisation and Calculus of Variations, vol. 29, p. 33, 2023
work page 2023
-
[22]
Unifying distribution- ally robust optimization via optimal transport theory,
J. Blanchet, D. Kuhn, J. Li, and B. Taskesen, “Unifying distribution- ally robust optimization via optimal transport theory,”arXiv preprint arXiv:2308.05414, 2023
-
[23]
Entropy-regularized Wasserstein distributionally robust shape and topology optimization,
C. Dapogny, F. Iutzeler, A. Meda, and B. Thibert, “Entropy-regularized Wasserstein distributionally robust shape and topology optimization,” Structural and Multidisciplinary Optimization, vol. 66, no. 3, p. 42, 2023
work page 2023
-
[24]
Data-driven Distribution- ally Robust Control Based on Sinkhorn Ambiguity Sets,
R. Cescon, A. Martin, and G. Ferrari-Trecate, “Data-driven Distribution- ally Robust Control Based on Sinkhorn Ambiguity Sets,” in2025 IEEE 64th Conference on Decision and Control (CDC). IEEE, 2025, pp. 4708–4713
work page 2025
-
[25]
Sinkhorn distributionally robust state estimation via system level synthesis,
Y . Feng, X. Li, S. X. Ding, H. Ye, and C. Shang, “Sinkhorn Distri- butionally Robust State Estimation via System Level Synthesis,”arXiv preprint arXiv:2602.08018, 2026
-
[26]
Distributionally robust optimization,
D. Kuhn, S. Shafiee, and W. Wiesemann, “Distributionally robust optimization,”Acta Numerica, vol. 34, pp. 579–804, 2025
work page 2025
-
[27]
Sinkhorn distributionally robust optimiza- tion,
J. Wang, R. Gao, and Y . Xie, “Sinkhorn distributionally robust optimiza- tion,”Operations Research, 2025
work page 2025
-
[28]
Villaniet al.,Optimal transport: old and new
C. Villaniet al.,Optimal transport: old and new. Springer, 2009, vol. 338
work page 2009
-
[29]
Kallenberg,Foundations of modern probability, 2nd ed
O. Kallenberg,Foundations of modern probability, 2nd ed. Springer- Verlag, New York, 2002
work page 2002
-
[30]
Wasserstein distributionally robust optimization: Theory and appli- cations in machine learning,
D. Kuhn, P. M. Esfahani, V . A. Nguyen, and S. Shafieezadeh-Abadeh, “Wasserstein distributionally robust optimization: Theory and appli- cations in machine learning,” inOperations research & management science in the age of analytics. Informs, 2019, pp. 130–166
work page 2019
-
[31]
S. Shafieezadeh-Abadeh, L. Aolaritei, F. D ¨orfler, and D. Kuhn, “New perspectives on regularization and computation in optimal transport-based distributionally robust optimization,”arXiv preprint arXiv:2303.03900, 2023
-
[32]
On linear optimization over Wasserstein balls,
M.-C. Yue, D. Kuhn, and W. Wiesemann, “On linear optimization over Wasserstein balls,”Mathematical Programming, vol. 195, no. 1, pp. 1107–1122, 2022
work page 2022
-
[33]
Optimization of conditional value- at-risk,
R. T. Rockafellar and S. Uryasev, “Optimization of conditional value- at-risk,”Journal of risk, vol. 2, pp. 21–42, 2000
work page 2000
-
[34]
R. Cescon, A. Martin, and G. Ferrari-Trecate, “On the global optimality of linear policies for Sinkhorn distributionally robust linear quadratic control,”arXiv preprint arXiv:2509.00956, 2025
-
[35]
A design of discrete-time integral controllers with computation delays via loop transfer recovery,
T. Ishihara, H.-J. Guo, and H. Takeda, “A design of discrete-time integral controllers with computation delays via loop transfer recovery,” Automatica, vol. 28, no. 3, pp. 599–603, 1992. [36]U.S. Military Handbook, U.S. Department of Defense, 1997
work page 1992
-
[36]
Creating a unified graphical wind turbulence model from multiple specifications,
S. Gage, “Creating a unified graphical wind turbulence model from multiple specifications,” inAIAA Modeling and simulation technologies conference and exhibit, 2003, p. 5529
work page 2003
-
[37]
R. T. Rockafellar and R. J.-B. Wets,Variational analysis. Springer Science & Business Media, 2009, vol. 317
work page 2009
-
[38]
Billingsley,Convergence of probability measures
P. Billingsley,Convergence of probability measures. John Wiley & Sons, 2013
work page 2013
-
[39]
J. Munkres,Topology, ser. Featured Titles for Topology. Prentice Hall, Incorporated, 2000
work page 2000
-
[40]
Entropy-regularized 2- Wasserstein distance between Gaussian measures,
A. Mallasto, A. Gerolin, and H. Q. Minh, “Entropy-regularized 2- Wasserstein distance between Gaussian measures,”Information Geom- etry, vol. 5, no. 1, pp. 289–323, 2022
work page 2022
-
[41]
Lecture notes on information theory,
Y . Polyanskiy and Y . Wu, “Lecture notes on information theory,”Lecture Notes for ECE563 (UIUC) and, vol. 6, no. 2012-2016, p. 7, 2014
work page 2012
-
[42]
Wasserstein metric and subordination,
P. Cl ´ement and W. Desch, “Wasserstein metric and subordination,” Studia Mathematica, vol. 1, no. 189, pp. 35–52, 2008
work page 2008
-
[43]
A system-level approach to controller synthesis,
Y .-S. Wang, N. Matni, and J. C. Doyle, “A system-level approach to controller synthesis,”IEEE Transactions on Automatic Control, vol. 64, no. 10, pp. 4079–4093, 2019
work page 2019
-
[44]
M. Sion, “On general minimax theorems,”Pacific Journal of Mathemat- ics, vol. 8, no. 1, pp. 171–176, 1958
work page 1958
-
[45]
S. Boyd and L. Vandenberghe,Convex optimization. Cambridge university press, 2004. APPENDIXI AUXILIARYRESULTS We review some well-known facts from measure theory and variational analysis that are used to prove our results. Later in the section we also present a brief overview of the topology of the Wasserstein space. Definition 6(Lower limit and lower sem...
work page 2004
-
[46]
The affine subspace defined by I−ZA−ZB Φxx Φxy Φux Φuy = I0 ,(23a) Φxx Φxy Φux Φuy I−ZA −C = I 0 ,(23b) I−ZA−ZB ϕx ϕu = 0,(23c) parameterizes all possible system responses(20)
-
[47]
For any vectors{ϕ x,ϕ u}and block lower-triangular ma- trices{Φ xx,Φ xy,Φ ux,Φ uy}satisfying(23), the affine controller in(10)withK=Φ uy −Φ uxΦ−1 xx Φxy and v=ϕ u −Φ uyCϕx +Φ uxΦ−1 xx ΦxyCϕx achieves the desired response. Proof.Proof of part 1. LetKbe any block lower-triangular operator andvany real vector. We verify (23a) by using the definitions ofΦ xx ...
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.