An online algorithm for zero-sum LQ games with unknown dynamics combines model estimation and surrogate selection to achieve regret bounds on policy convergence.
Mechanism design theory in control engineering: A tutorial and overview of applications in communication, power grid, transportation, and security systems
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
eess.SY 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
An Online Learning Approach for Two-Player Zero-Sum Linear Quadratic Games
An online algorithm for zero-sum LQ games with unknown dynamics combines model estimation and surrogate selection to achieve regret bounds on policy convergence.