Develops stochastic first-order methods for robust policy evaluation and approximate policy iteration in continuous-state robust MDPs, achieving 'O(1/ε^{2}) sample complexity for both evaluation and optimization.
A novel catalyst scheme for stochastic minimax optimization.arXiv preprint arXiv:2311.02814, 2023
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
math.OC 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Robust Markov Decision Processes on Continuous State Spaces
Develops stochastic first-order methods for robust policy evaluation and approximate policy iteration in continuous-state robust MDPs, achieving 'O(1/ε^{2}) sample complexity for both evaluation and optimization.