A constrained optimization perspective on actor critic algorithms and application to network routing

H.L. Prasad; Prakash Chandra; Prashanth L.A.; Shalabh Bhatnagar

arxiv: 1507.07984 · v1 · pith:N5W7WYPHnew · submitted 2015-07-28 · 💻 cs.LG · math.OC

A constrained optimization perspective on actor critic algorithms and application to network routing

Prashanth L.A. , H.L. Prasad , Shalabh Bhatnagar , Prakash Chandra This is my paper

classification 💻 cs.LG math.OC

keywords actoralgorithmsapplicationnetworkoptimizationroutingactor-criticalgorithm

0 comments

read the original abstract

We propose a novel actor-critic algorithm with guaranteed convergence to an optimal policy for a discounted reward Markov decision process. The actor incorporates a descent direction that is motivated by the solution of a certain non-linear optimization problem. We also discuss an extension to incorporate function approximation and demonstrate the practicality of our algorithms on a network routing application.

This paper has not been read by Pith yet.

A constrained optimization perspective on actor critic algorithms and application to network routing

discussion (0)