B\"uchi Objectives in Countable MDPs

Mahsa Shirmohammadi; Patrick Totzke; Richard Mayr; Stefan Kiefer

arxiv: 1904.11573 · v2 · pith:ACXKPI77new · submitted 2019-04-25 · 🧮 math.PR · cs.FL· math.OC

B\"uchi Objectives in Countable MDPs

Stefan Kiefer , Richard Mayr , Mahsa Shirmohammadi , Patrick Totzke This is my paper

classification 🧮 math.PR cs.FLmath.OC

keywords markovstrategiesobjectivesonlyquestionuchialwaysanswer

0 comments

read the original abstract

We study countably infinite Markov decision processes with B\"uchi objectives, which ask to visit a given subset of states infinitely often. A question left open by T.P. Hill in 1979 is whether there always exist $\varepsilon$-optimal Markov strategies, i.e., strategies that base decisions only on the current state and the number of steps taken so far. We provide a negative answer to this question by constructing a non-trivial counterexample. On the other hand, we show that Markov strategies with only 1 bit of extra memory are sufficient.

This paper has not been read by Pith yet.

B\"uchi Objectives in Countable MDPs

discussion (0)