Variance Adjusted Actor Critic Algorithms

Aviv Tamar; Shie Mannor

arxiv: 1310.3697 · v1 · pith:E3WKC5ZUnew · submitted 2013-10-14 · 📊 stat.ML · cs.LG· cs.SY

Variance Adjusted Actor Critic Algorithms

Aviv Tamar , Shie Mannor This is my paper

classification 📊 stat.ML cs.LGcs.SY

keywords actor-criticcriticfunctionobjectivevariance-adjustedactoradjustedalgorithm

0 comments

read the original abstract

We present an actor-critic framework for MDPs where the objective is the variance-adjusted expected return. Our critic uses linear function approximation, and we extend the concept of compatible features to the variance-adjusted setting. We present an episodic actor-critic algorithm and show that it converges almost surely to a locally optimal point of the objective function.

This paper has not been read by Pith yet.

Variance Adjusted Actor Critic Algorithms

discussion (0)