pith. sign in

arxiv: 1003.3418 · v1 · submitted 2010-03-17 · 💻 cs.DS

Exponential Lower Bounds For Policy Iteration

classification 💻 cs.DS
keywords boundsiterationlowerpolicydecisionexponentialmarkovprocesses
0
0 comments X
read the original abstract

We study policy iteration for infinite-horizon Markov decision processes. It has recently been shown policy iteration style algorithms have exponential lower bounds in a two player game setting. We extend these lower bounds to Markov decision processes with the total reward and average-reward optimality criteria.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.