The proof is in Section B.6.4

the policy K =0is( κ, γ)-strongly stable, ensures thatP(α⊤ j zK,w t ≤β−ϵ ) ≥ 1 −δ for all j∈[J], t∈[T], 8

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Rate-Optimal Regret for the Safe Learning-based Control of the Constrained Linear Quadratic Regulator

math.OC · 2026-04-24 · unverdicted · novelty 8.0

An algorithm for constrained stochastic LQR achieves tilde O of square root T regret and chance constraint satisfaction via SDP-based optimistic policies scaled for safety.

citing papers explorer

Showing 1 of 1 citing paper.

Rate-Optimal Regret for the Safe Learning-based Control of the Constrained Linear Quadratic Regulator math.OC · 2026-04-24 · unverdicted · none · ref 4
An algorithm for constrained stochastic LQR achieves tilde O of square root T regret and chance constraint satisfaction via SDP-based optimistic policies scaled for safety.

The proof is in Section B.6.4

fields

years

verdicts

representative citing papers

citing papers explorer