A Linearly Relaxed Approximate Linear Program for Markov Decision Processes

Chandrashekar Lakshminarayanan; Csaba Szepesvari; Shalabh Bhatnagar

arxiv: 1704.02544 · v1 · pith:4A4TXNZHnew · submitted 2017-04-09 · 💻 cs.SY

A Linearly Relaxed Approximate Linear Program for Markov Decision Processes

Chandrashekar Lakshminarayanan , Shalabh Bhatnagar , Csaba Szepesvari This is my paper

classification 💻 cs.SY

keywords linearconstraintsnumberapproximatedecisionlinearlylralpmarkov

0 comments

read the original abstract

Approximate linear programming (ALP) and its variants have been widely applied to Markov Decision Processes (MDPs) with a large number of states. A serious limitation of ALP is that it has an intractable number of constraints, as a result of which constraint approximations are of interest. In this paper, we define a linearly relaxed approximation linear program (LRALP) that has a tractable number of constraints, obtained as positive linear combinations of the original constraints of the ALP. The main contribution is a novel performance bound for LRALP.

This paper has not been read by Pith yet.

A Linearly Relaxed Approximate Linear Program for Markov Decision Processes

discussion (0)