Generalised Entropy MDPs and Minimax Regret

Christos Dimitrakakis; Emmanouil G. Androulakis

arxiv: 1412.3276 · v1 · pith:K5A2OPUZnew · submitted 2014-12-10 · 💻 cs.LG · stat.ML

Generalised Entropy MDPs and Minimax Regret

Emmanouil G. Androulakis , Christos Dimitrakakis This is my paper

classification 💻 cs.LG stat.ML

keywords banditbayesianbeliefsconsiderdiscoverdiscussentropyextend

0 comments

read the original abstract

Bayesian methods suffer from the problem of how to specify prior beliefs. One interesting idea is to consider worst-case priors. This requires solving a stochastic zero-sum game. In this paper, we extend well-known results from bandit theory in order to discover minimax-Bayes policies and discuss when they are practical.

This paper has not been read by Pith yet.

Generalised Entropy MDPs and Minimax Regret

discussion (0)