pith. sign in

arxiv: 1207.1415 · v1 · pith:IBYGPU36new · submitted 2012-07-04 · 💻 cs.AI

Approximate Linear Programming for First-order MDPs

classification 💻 cs.AI
keywords first-ordertechniqueallowsapplyapproximatedomainfomdpslinear
0
0 comments X
read the original abstract

We introduce a new approximate solution technique for first-order Markov decision processes (FOMDPs). Representing the value function linearly w.r.t. a set of first-order basis functions, we compute suitable weights by casting the corresponding optimization as a first-order linear program and show how off-the-shelf theorem prover and LP software can be effectively used. This technique allows one to solve FOMDPs independent of a specific domain instantiation; furthermore, it allows one to determine bounds on approximation error that apply equally to all domain instantiations. We apply this solution technique to the task of elevator scheduling with a rich feature space and multi-criteria additive reward, and demonstrate that it outperforms a number of intuitive, heuristicallyguided policies.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.