pith. sign in

arxiv: 1605.02099 · v1 · pith:KOJLXCXNnew · submitted 2016-05-06 · 💻 cs.LG

Some Simulation Results for Emphatic Temporal-Difference Learning Algorithms

classification 💻 cs.LG
keywords algorithmsemphaticlearningresultssimulationsometemporal-differenceanalysis
0
0 comments X
read the original abstract

This is a companion note to our recent study of the weak convergence properties of constrained emphatic temporal-difference learning (ETD) algorithms from a theoretic perspective. It supplements the latter analysis with simulation results and illustrates the behavior of some of the ETD algorithms using three example problems.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.