Nonapproximability Results for Partially Observable Markov Decision Processes

C. Lusena; J. Goldsmith; M. Mundhenk

arxiv: 1106.0242 · v1 · pith:X4EABM22new · submitted 2011-06-01 · 💻 cs.AI

Nonapproximability Results for Partially Observable Markov Decision Processes

J. Goldsmith , C. Lusena , M. Mundhenk This is my paper

classification 💻 cs.AI

keywords collapsesconstantdecisionfindingguaranteesmarkovobservablepartially

0 comments

read the original abstract

We show that for several variations of partially observable Markov decision processes, polynomial-time algorithms for finding control policies are unlikely to or simply don't have guarantees of finding policies within a constant factor or a constant summand of optimal. Here "unlikely" means "unless some complexity classes collapse," where the collapses considered are P=NP, P=PSPACE, or P=EXP. Until or unless these collapses are shown to hold, any control-policy designer must choose between such performance guarantees and efficient computation.

This paper has not been read by Pith yet.

Nonapproximability Results for Partially Observable Markov Decision Processes

discussion (0)