pith. sign in

arxiv: 1511.08456 · v1 · pith:XDTWAKTQnew · submitted 2015-11-26 · 💻 cs.AI

A Symbolic SAT-based Algorithm for Almost-sure Reachability with Small Strategies in POMDPs

classification 💻 cs.AI
keywords problemstrategiessymbolicalgorithmalmost-surefirstpomdpsreachability
0
0 comments X
read the original abstract

POMDPs are standard models for probabilistic planning problems, where an agent interacts with an uncertain environment. We study the problem of almost-sure reachability, where given a set of target states, the question is to decide whether there is a policy to ensure that the target set is reached with probability 1 (almost-surely). While in general the problem is EXPTIME-complete, in many practical cases policies with a small amount of memory suffice. Moreover, the existing solution to the problem is explicit, which first requires to construct explicitly an exponential reduction to a belief-support MDP. In this work, we first study the existence of observation-stationary strategies, which is NP-complete, and then small-memory strategies. We present a symbolic algorithm by an efficient encoding to SAT and using a SAT solver for the problem. We report experimental results demonstrating the scalability of our symbolic (SAT-based) approach.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.