Central-limit approach to risk-aware Markov decision processes
classification
🧮 math.OC
cs.SYeess.SY
keywords
riskapproachdecisionmarkovpolicyprocessesalgorithmassociated
read the original abstract
Whereas classical Markov decision processes maximize the expected reward, we consider minimizing the risk. We propose to evaluate the risk associated to a given policy over a long-enough time horizon with the help of a central limit theorem. The proposed approach works whether the transition probabilities are known or not. We also provide a gradient-based policy improvement algorithm that converges to a local optimum of the risk objective.
This paper has not been read by Pith yet.
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.