pith. sign in

arxiv: 1512.00583 · v1 · pith:6WV4JRWKnew · submitted 2015-12-02 · 🧮 math.OC · cs.SY· eess.SY

Central-limit approach to risk-aware Markov decision processes

classification 🧮 math.OC cs.SYeess.SY
keywords riskapproachdecisionmarkovpolicyprocessesalgorithmassociated
0
0 comments X
read the original abstract

Whereas classical Markov decision processes maximize the expected reward, we consider minimizing the risk. We propose to evaluate the risk associated to a given policy over a long-enough time horizon with the help of a central limit theorem. The proposed approach works whether the transition probabilities are known or not. We also provide a gradient-based policy improvement algorithm that converges to a local optimum of the risk objective.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.