pith. sign in

arxiv: 1310.3697 · v1 · pith:E3WKC5ZUnew · submitted 2013-10-14 · 📊 stat.ML · cs.LG· cs.SY

Variance Adjusted Actor Critic Algorithms

classification 📊 stat.ML cs.LGcs.SY
keywords actor-criticcriticfunctionobjectivevariance-adjustedactoradjustedalgorithm
0
0 comments X
read the original abstract

We present an actor-critic framework for MDPs where the objective is the variance-adjusted expected return. Our critic uses linear function approximation, and we extend the concept of compatible features to the variance-adjusted setting. We present an episodic actor-critic algorithm and show that it converges almost surely to a locally optimal point of the objective function.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.