Dynamic Programming for Structured Continuous Markov Decision Problems

Nicolas Meuleau; Richard Dearden; Richard Washington; Zhengzhu Feng

arxiv: 1207.4115 · v1 · pith:BBUYKZZQnew · submitted 2012-07-11 · 💻 cs.AI

Dynamic Programming for Structured Continuous Markov Decision Problems

Zhengzhu Feng , Richard Dearden , Nicolas Meuleau , Richard Washington This is my paper

classification 💻 cs.AI

keywords approachcontinuousdecisiondescribedynamicefficientlylinearmarkov

0 comments

read the original abstract

We describe an approach for exploiting structure in Markov Decision Processes with continuous state variables. At each step of the dynamic programming, the state space is dynamically partitioned into regions where the value function is the same throughout the region. We first describe the algorithm for piecewise constant representations. We then extend it to piecewise linear representations, using techniques from POMDPs to represent and reason about linear surfaces efficiently. We show that for complex, structured problems, our approach exploits the natural structure so that optimal solutions can be computed efficiently.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

ADaPT: Token-Level Decoupling for Efficient Large Reasoning Models
cs.LG 2026-06 unverdicted novelty 5.0

ADaPT decouples efficiency and correctness signals at the token level via a mode-selection token, allowing a single model to control the efficiency-performance trade-off during inference.