On the convergence of optimistic policy iteration for stochastic shortest path problem

Yuanlong Chen

arxiv: 1808.08763 · v2 · pith:ZLJ4E4JMnew · submitted 2018-08-27 · 💻 cs.LG · stat.ML

On the convergence of optimistic policy iteration for stochastic shortest path problem

Yuanlong Chen This is my paper

classification 💻 cs.LG stat.ML

keywords policyconvergenceiterationoptimisticpathproblemshorteststochastic

0 comments

read the original abstract

In this paper, we prove some convergence results of a special case of optimistic policy iteration algorithm for stochastic shortest path problem. We consider both Monte Carlo and $TD(\lambda)$ methods for the policy evaluation step under the condition that the termination state will eventually be reached almost surely.

This paper has not been read by Pith yet.

On the convergence of optimistic policy iteration for stochastic shortest path problem

discussion (0)