Develops stochastic first-order methods for robust policy evaluation and approximate policy iteration in continuous-state robust MDPs, achieving 'O(1/ε^{2}) sample complexity for both evaluation and optimization.
Towards general function approximation in zero-sum Markov games.arXiv preprint arXiv:2107.14702, 2021
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
math.OC 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Robust Markov Decision Processes on Continuous State Spaces
Develops stochastic first-order methods for robust policy evaluation and approximate policy iteration in continuous-state robust MDPs, achieving 'O(1/ε^{2}) sample complexity for both evaluation and optimization.