pith. sign in

arxiv: 1403.4175 · v1 · pith:LX4LLBMTnew · submitted 2014-03-17 · 💻 cs.SY · math.OC

Approximate Dynamic Programming based on Projection onto the (min,+) subsemimodule

classification 💻 cs.SY math.OC
keywords approximatedynamicprogrammingprojectionsubsemimodulealgorithmbellmandevelop
0
0 comments X
read the original abstract

We develop a new Approximate Dynamic Programming (ADP) method for infinite horizon discounted reward Markov Decision Processes (MDP) based on projection onto a subsemimodule. We approximate the value function in terms of a $(\min,+)$ linear combination of a set of basis functions whose $(\min,+)$ linear span constitutes a subsemimodule. The projection operator is closely related to the Fenchel transform. Our approximate solution obeys the $(\min,+)$ Projected Bellman Equation (MPPBE) which is different from the conventional Projected Bellman Equation (PBE). We show that the approximation error is bounded in its $L_\infty$-norm. We develop a Min-Plus Approximate Dynamic Programming (MPADP) algorithm to compute the solution to the MPPBE. We also present the proof of convergence of the MPADP algorithm and apply it to two problems, a grid-world problem in the discrete domain and mountain car in the continuous domain.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.